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PROTEIN KINASES 



This application is a Continuation-in-Part of U.S. Application Serial 
No. 08/008,001, filed January 21, 1993, which is a Continuation-in-Part of U.S. 
Application Serial No. 728,783, filed July 3, 1991. 

5 FIELD OF THE INVENTION 

The present invention relates to novel polynucleotides encoding 
polypeptides which correspond to the class of protein kinase isolates heretofore 
referred to as casein kinase I and which possess protein kinase and/or DNA 
recombination/repair promoting functional capabilities. 

10 BACKGROUND OF THE INVENTION 

Protein Kinases 

The protein kinases comprise an exceptionally large family of 
eukaryotic proteins which mediate the responses of cells to external stimuli and 
are related by amino acid sequence homology within the so-called "catalytic 

15 domain" of the enzymes. To date, in excess of 100 unique membCTS of the 
protein kinase family from a wide variety of eukaryotic organisms have been 
described and characterized at the amino acid sequence level. See, e.g., Hanks, 
et ah {Science, 241:42-52, 1988) which presents a sequence alignment of 65 
protein kinase catalytic domains which range in size from about 250 to 300 amino 

20 acids and Hanks, et aL {Methods in EnzymoL, 2QQ:38-62, 1991) presenting a 
catalytic domain sequence alignment for 117 distinct protein kinase family 
members including a variety of vertebrate, invertebrate, higher plant and yeast 
species enzymes. The location of the catalytic domain within a protein kinase is 
not fixed. In most single subunit enzymes, tiie domain is near the carboxy 

25 terminus of the polypeptide while in multimeric protein kinases the catalytic 
domain takes up almost the entirety of the subunit polypeptide. 

Protein kinases are generally classified into a protein- 
serine/threonine subfamily or a protein-tyrosine subfamily on the basis of 
phosphorylation substrate specificity. Among the many classes of enzymes within 
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the protein-serine/threonine kinase subfamily are two distinct classes which have 
been designated casein kinase I and casein kinase II based on the order of their 
elution from DEAE-cellulose. The casein kinases are distinguished from other 
protein kinases by their ability to phosphorylate serine or threonine residues 
5 within acidic recognition sequences such as found in casein. Tuazon, et aL , (Adv. 
in Second Messenger and Phosphoprotein Res., 23:123-164, 1991) presents a 
review of over 200 publications related to casein kinase I and n, addressing the 
physicochemical characterization, recognition sequences, substrate specificity and 
effects on metabolic regulation for these two classes of enzymes. Casein kinase 

10 n is active as a heterotetramer and the complete amino acid sequences of human, 
rat, Drosophila and yeast species catalytic regions have been determined. Despite 
the fact that partially purified casein kinase I preparations have been obtained 
from cell nuclei, cytoplasm, and cell membranes of various plant and animal 
species, prior to the present invention, nothing was known concerning the primary 

15 structure of its enzymatically active monomeric subunit. 

As of the time of the present invention, therefore, there existed a 
significant need in the art for information concerning the primary structure (amino 
acid sequence) of protein-serine/threonine kinase enzymes of the casein kinase I 
class. Such information, provided in the form of DNA sequences encoding one 

20 or more of these kinases (from which primary structures could be deduced), 
would allow for the large scale production of kinases by recombinant techniques 
as well as for determination of the distribution and function of these enzymes, the 
structural distinctions between membrane-bound and non-membranous forms, the 
potential ligand-receptor interactions in which these kinases interact, and the 

25 identification of agents capable of modulating ligand-receptor binding, kinase, and 
other activities. 

DNA Recombination And Repair 

Chromosomes experience single-stranded or double-stranded breaks 
as a result of energy-rich radiation, chemical agents, as well as spontaneous 
30 breaks occurring during replication among others. Although genes present in the 
chromosomes undergo continuous damage, repair, exchange, transposition, and 
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splicing, certain enzymes protect or restore the specific base sequences of the 
chromosome. 

The repair of DNA damage is a complex process that involves the 
coordination of a large number of gene products. This complexity is in part 
5 dependent upon both the form of DNA damage and cell cycle progression. For 
example, in response to ultraviolet (UV) irradiation, cells can employ 
photoreactivation or excision repair functions to correct genetic lesions. The 
repair of strand breaks, such as those created by X-rays, can proceed through 
recombinational mechanisms. For many forms of DNA damage, the cell is 

10 induced to arrest in the G2 phase of the cell cycle. During this G2 arrest, lesions 
are repaired to ensure chromosomal integrity prior to mitotic segregation. 

Since the transfer of genetic information from generation to 
generation is dependent on the integrity of DNA, it is important to identify those 
gene products which affect or regulate genetic recombination and repair. Through 

15 the use of organisms with specific genetic mutations, the normal functional gene 
can be obtained, molecularly cloned, and the gene products studied. 

In eukaryotes such as Saccharomyces cerevisiae, genetic studies 
have defined repair-deficient mutants which have allowed the identification of 
more than 30 radiation-sensitive (RAD) mutants (Haynes, et aL, in Molecular 

20 Biology of the Yeast Saccharomyces, pp. 371, 1981; J. Game in Yeast Genetics: 
Fundamerual and Applied Aspects, pp. 109, 1983). These mutants can be 
grouped into three classes depending upon their sensitivities. These classes 
broadly define excision-repair, error-prone repair, and recombinational-repair 
functions. The molecular characterization of yeast RAD genes has increased the 

25 understanding of the enzymatic machinery involved in excision repair, as well as 
the arrest of cell division by DNA damage. 

The understanding of RAD genes and their expression products has 
become increasingly important as research continues to develop more effective 
therapeutic compositions. Often these new compositions appear quite effective 

30 against a particular disease condition, such as certain tumors, but prove to be too 
toxic for in vivo therapy in an animal having the disease. Indeed, these 
compositions can actually increase the likelihood of mutagenesis. 
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Most agents that are mutagenic or carcinogenic are in themselves 
unreactive, but are broken down to reactive intermediates in vivo. It is these 
reactive intermediates which interact with DNA to produce a mutation. This 
event is thought to be the initial step in chemical carcinogenesis. Mutations in a 
5 large number of genes affect the cellular response to agents that damage DNA, 
In all likelihood, many of these mutated genes encode enzymes that participate in 
DNA repair systems. Consequently, when the repair system is compromised, the 
cells become extremely sensitive to toxic agents. Although the DNA may revert 
to normal when DNA repair mechanisms operate successfully, the failure of such 

10 mechanisms can result in a transformed tumor cell which continues to proliferate. 

Although there are currently available tests to determine the toxicity 
or mutagenicity of chemical agents and compositions, there are limitations in both 
laboratory screening procedures and animal toxicity tests. These limitations 
include extrapolating laboratory data from animals to humans. There is often a 

15 large measure of uncertainty when attempting to correlate the results obtained in 
laboratory animals with effects in human subjects. In most cases, doses of the 
test drug have been used in the animal which are too high to be safely 
administered to humans. In addition, some types of toxicity can be detected if the 
drug is administered in a particular species, yet may be missed if the experiment 

20 is not done in the correct animal species. Moreover, many currently available 
laboratory tests are incapable of detecting certain types of toxic manifestations 
which occur in man. 

Phenotypic complementation, as a way of identifying homologous 
normal functional genes, is widely used. For example, the human homologue of 

25 the yeast cell cycle control gene, cdc 2, was cloned by expressing a human cDNA 
library in Schizosaccharomyces pombe and selecting those clones which could 
complement a mutation in the yeast cdc 2 gene (Lee, et aL, Nature, 327:31. 
1987). A mammalian gene capable of reverting the heat shock sensitivity of the 
RAS2^^^^ gene of yeast, has also been cloned by using complementation 

30 (Colicelli, et al, Proc.NariAcad.ScL USA, 86:3599, 1989). A rat brain cDNA 
library was used to clone a mammalian cDNA that can complement the loss of 
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growth control associated with the activated RAS2 gene in yeast. The gene, DPD 
(dunce-like phosphodiesterase), encodes a high-affinity CAMP phosphodiesterase. 

In summary, limitations and uncertainties of existing laboratory 
tests fail to provide an accurate method of examining the effects of a composition 
5 on DNA integrity. In view of this, a considerable need exists for screening 
methodologies which are inexpensive, rapid, and contain the relevant gene from 
the animal which is to be treated with the composition. Such methods provide a 
direct assay to determine if a composition interferes with the DNA repair system 
of a cell. 

10 SUMMARY OF THE INVENTION 

In one of its aspects, the present invention provides purified and 
isolated polynucleotides (e.g., DNA sequences and RNA transcripts thereoO 
encoding eukaryotic protein kinases of the casein kinase I class herein designated 
as "HRR25-likc" proteins and characterized by greater than 35% amino acid 

15 sequence homology with the prototypical yeast enzyme HKR25 through the protein 
kinase catalytic domain thereof. Polynucleotides provided by the invention 
include RNAs, mRNAs and DNAs, including antisense forms thereof. Preferred 
DNA sequences of the invention include genomic and cDNA sequences as well 
as wholly or partially chemically synthesized DNA sequences and biological 

20 replicas thereof. Specifically illustrating the invention are Saccharomyces 
cerevisiae DNAs including those encoding HRR25 and NUFl, 
Schizosaccharomyces pombe DNAs including those encoding Hhpl + and 
Hhp2'\-, and human DNAs including those encoding CKIalHu, CKIa2Hu, 
CKIa3Hu, CKI7IHU, CKI72HU, and CKISHu. Also provided are autonomously 

25 replicating recombinant constructions such as plasmid and viral DNA vectors 
incorporating such sequences and especially vectors wherein DNA encoding an 
flRR25-like casein kinase I protein is linked to an endogenous or exogenous 
expression control DNA sequence. 

According to another aspect of the invention, host cells, especially 

30 unicellular host cells such as procaryotic and eukaryotic cells, are stably 
transformed with DNA sequences of the invention in a manner allowing the 
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desired polypeptides to be expressed therein. Host cells expressing such 
HRR25-]ike products can serve a variety of useful purposes. To the extent that 
the expressed products are "displayed" on host cell surfaces, the cells may 
constitute a valuable immunogen for the development of antibody substances 
5 specifically immunoreactive therewith. 

Host cells of the invention are conspicuously useful in methods for 
the large scale production of HRR25'like proteins wherein the cells are grown in 
a suitable culture medium and the desired polypeptide products are isolated from 
the cells or from the medium in which the cells are grown. 

10 Also comprehended by the present invention are antibody 

substances (e.g., monoclonal and polyclonal antibodies, single chain antibodies, 
chimeric antibodies, CDR-grafted antibodies and the like) and other binding 
proteins which are specific for H/JR25-like proteins (i.e., non-reactive with 
protein kinase molecules which are not related by at least 35% homology with 

15 HKR25 through the protein kinase catalytic domain). Antibody substances can be 
developed using isolated natural or recombinant fl7f/?25-like proteins or cells 
expressing such products on their surfaces. The antibody substances are useful, 
in turn, for purifying recombinant and naturally occurring flKK2J-like 
polypeptides and identifying cells producing such polypeptides on their surfaces. 

20 The antibody substances and other binding proteins are also manifestiy useful in 
modulating (i.e., blocking, inhibiting, or stimulating) ligand-receptor binding 
reactions involving HRR25-like proteins. Anti idiotypic antibodies specific for 
anti-ffli/J2J-like antibody substances are also contemplated. Assays for the 
detection and quantification of ffly225-like proteins on cell surfaces and in fiuids 

25 such as serum and cytoplasmic fractions may involve a single antibody substance 
or multiple antibody substances in a "sandwich" assay format. 

Recombinant jHR/?25-like protein products obtained according to the 
invention have been observed to display a number of properties which are unique 
among the eukaiyotic protein kinases. As one example, the HRR25 protein 

30 possesses both protein-tyrosine kinase and protein-serine/threonine kinase 
activities. Moreover, HRR25 operates to promote repair of DNA strand breaks 
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at a specific nucleotide sequence and is the only protein kinase known to have 
such recombination/repair promoting activity. 

The DNA sequence information for yeast and mammalian 
(including human) species HRR25-like proteins which is provided by the present 
5 invention makes possible the identification and isolation of DNAs encoding other 
HRR25-like proteins by such well-known techniques as DNA/DNA hybridization 
and polymerase chain reaction (PCR) cloning. 

Recombinant flKK25-like proteins and host cells expressing the 
same are useful in screening methods designed to examine the effects of various 
10 compositions on DNA break repair and protein kinase activities of the proteins. 
Protein kinase inhibitory effects may be assessed by well-known screening 
procedures such as described in Hidaka, et al {Methods in Enzymology, 201:328- 
339, 1991). 



BRIEF DESCRIPTION OF THE DRAWING 

15 Further aspects and advantages of the present invention will be 

apparent upon consideration of the following detailed description of presendy 
preferred embodiments thereof, reference being made to the drawing wherein: 

Figure 1 (A) presents an alignment of the predicted amino add 
sequence of HRR25 with the catalytic domains of the yeast CDC28, yeast KSSl 

20 and human RAFl protein kinases. Figure 1(B) shows a schematic representation 
of the structure of HRR25, and 

Figure 2 presents an alignment of the predicted amino acid 
sequences of HRR25 with the sequences of three other Saccharomyces cerevisiae 
//i?R2J-like proteins (YCK1/CKI2, YCK2/CKI1, and NUFl), two HRIl25-]ikc 

25 proteins (Hhpl+ and Hhp2-^) from Schizosaccharomyces pombe and three 
putative isoforms (CKIalHu, CKlQf2Hu, and CKXofSHu) of a human ffl?/J25-like 
protein. 
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DETAILE» DESCRIPTION OF THE INVENTION 
In one of its aspects, the present invention relates to a DNA 
encoding a recombination/repair promoting polypeptide which can be used in an 
assay system to examine the effects of various compositions on DNA integrity. 
5 These functional sequences, which can be characterized by their ability to 
promote restoration of DNA strand breaks, permit the screening of compositions 
to determine whether a particular composition has an effect on the restoration of 
such repair activity. The invention also provides a DNA sequence encoding a 
polypeptide which promotes normal mitotic recombination, but is defective in 

10 protein kinase activity and essentially unable to repair DNA strand breaks. This 
defective DNA sequence is highly useful for identifying other DNA sequences 
which encode proteins with functional protein kinase activity. In addition, the 
present invention relates to the polypeptide encoded by the defective DNA 
sequence, as well as the polypeptide encoded by the functional wild-type DNA. 

15 In order to identify a DNA sequence encoding a polypeptide with 

protein kinase activity, a method is provided whereby a DNA library is screened 
for nucleotide sequences capable of restoring DNA strand break repair in a mutant 
lacking such activity. A method is further provided for identifying a composition 
which affects the activity of a mammalian polypeptide having protein kinase 

20 activity, wherein the polypeptide is capable of restoring DNA double-strand break 
repair activity in a mutant lacking such activity. 

In general, the defective protein kinase can be characterized by its 
ability to promote normal mitotic recombination, while being essentially unable 
to repair DNA double-strand break including that which occurs at the cleavage 

25 site: 

I 

CAACAG 

GTTGTC 
t 

30 The DNA double-strand breaks which the defective protein kinase is essentially 
unable to repair can be induced by various means, including endonucleases, x- 
rays, or radiomimetic agents including alkylating agents. Preferred endonucleases 
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are those which recognize the same nucleotide cleavage site as endonuclease HO. 
Radiomimetic alkylating agents having methylmethane sulfonate activity are 
preferred. Those of skill in the art will be able to identify other agents which 
induce the appropriate DNA strand breaks without undue experimentation. 
5 The present invention specifically discloses mutants sensitive to 

continuous expression of the DNA double-strand endonuclease HO, which codes 
for a 65 kDa site-specific endonuclease that initiates mating type interconversion 
(Kostriken, et aL, Cold Spring Harbor Symp. Quant.BioL , 49:S9, 1984). These 
mutants are important to understanding the functions involved in recognizing and 

10 repairing damaged chromosomes. This invention also discloses a yeast wild-type 
DNA recombination and repair gene called HRR25 (HO and/or radiation Aepair). 
Homozygous mutant strains, hrr25-l, are sensitive to methylmethane sulfonate 
and X-rays, but not UV irradiation. The wild-type gene encodes a novel protein 
kinase, homologous to other serine/threonine kinases, which appears critical in 

15 activation of DNA repair functions by phosphorylation. 

The HRR25 kinase is important for normal cell growth, nuclear 
segregation, DNA repair and meiosis, and deletion of HRR25 results in cell cycle 
defects. These phenotypes, coupled with the sequence similarities between the 
HRR25 kinase and the Raf/c-mos protein kinase subgroup suggest that HRR25 

20 might play a similar role in 5. cerevisiae growth and development. The defects 
in DNA strand break repair and the aberrant growth properties revealed by 
mutations in HRR25 kinase, expands the role that protein kinases may play and 
places HKR25 in a functional category of proteins associated with DNA 
metabolism. 

25 The development of specific DNA sequences encoding protein 

kinase polypeptides of the invention can be accomplished using a variety of 
techniques. For example, methods which can be employed include (1) isolation 
of a double-stranded DNA sequence from the genomic DNA of the eukaryote; (2) 
chemical synthesis of a DNA sequence to provide the necessary codons for the 

30 polypeptide of interest; and (3) in vitro synthesis of a double stranded DNA 
sequence by reverse transcription of mRNA isolated from a eukaryotic donor cell. 
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in the latter case, a double stranded DNA complement of MRNA is eventually 
formed which is generally referred to as CDNA. 

The novel DNA sequences of the invention include all sequences 
useful in providing for expression in prokaryotic or eukaryotic host cells of 
5 polypeptides which exhibit the functional characteristics of the novel protein 
kinase of the invention. These DNA sequences comprise: (a) the DNA sequences 
as set forth in SEQ. I.D. No. 1 or their complementary strands; (b) DNA 
sequences which encode an amino acid sequence with at least about 35% 
homology in the protein kinase domain with the amino acid sequences encoded by 

10 the DNA sequences defined in (a) or fragments thereof; and (c) DNA sequences 
defined in (a) and (b) above. Specifically embraced in (b) are genomic DNA 
sequences which encode allelic variant forms. Part (c) specifically embraces the 
manufacture of DNA sequences which encode fragments of the protein kinase and 
analogs of the protein kinase wherein the DNA sequences thereof may incorporate 

15 codons which facilitate translation of mRNA. Also included in part (c) are DNA 
sequences which are degenerate as a result of the genetic code. 

The term "conservative variation" as used herein denotes the 
replacement of an amino acid residue by another, biologically similar residue. 
Examples of conservative variations include the substitution of one hydrophobic 

20 residue such as isoleucine, valine, leucine or methionine for another, or the 
substitution of one polar residue for another, such as the substitution of arginine 
for lysine, glutamic for aspartic acids, or glutamine for asparagine, and the like. 

With the DNA sequences of the invention in hand, it is a routine 
matter to prepare, subclone, and express smaller DNA fragments from this or a 

25 corresponding DNA sequences. The term "polypeptide" denotes any sequence of 
amino acids having the characteristic activity of the mutant or wild-type protein 
kinase of the invention, wherein the sequence of amino acids is encoded by all or 
part of the DNA sequences of the invention. 

The polypeptide resulting from expression of the DNA sequence of 

30 the invention can be further characterized as being free from association with 
other eukaryotic polypeptides or other contaminants which might otherwise be 
associated with the protein kinase in its natural cellular environment. 
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Isolation and purification of microbially expressed polypeptides 
provided by the invention may be by conventional means including, preparative 
chromatographic separations and immunological separations involving monoclonal 
and/or polyclonal antibody preparation. 
5 In general, recombinant expression vectors useful in the present 

invention contain a promotor sequence which facilitates the efficient transcription 
of the inserted eukaryotic genetic sequence. The expression vector typically 
contains an origin of replication, a promoter, and a terminator, as well as specific 
genes which are capable of providing phenotypic selection of the transformed 

10 cells. The transformed hosts can be grown in fermentors and cultured according 
to techniques known in the art to achieve optimal cell growth. The polypeptides 
of the invention can then be isolated from the growth medium, cellular lysates, 
or cellular membrane fractions. 

The DNA sequences of the present invention can be expressed in 

15 vivo in either prokaryotes or eukaryotes. Methods of expressing DNA sequences 
containing eukaryotic coding sequences in prokaryotes are well known in the art. 
Biologically functional viral and plasmid DNA vectors used to incorporate DNA 
sequences of the invention, for expression and replication in the host cell are well 
known in the art. For example, DNA can be inserted in yeast using appropriate 

20 vectors and introducing the product into the host cells. Various shuttle vectors 
for the expression of foreign genes in yeast have been reported (Heinemann, et 
al , Nature, 340:205, 1989; Rose, et ah , Gene, 60:237, 1987). Those of skill in 
the art will know of appropriate techniques for obtaining gene expression in both 
prokaryotes and eukaryotes, or can readily ascertain such techniques, without 

25 undue experimentation. 

Hosts include microbial, yeast, insect and mammalian host 
organisms. Thus, the term "host" is meant to include not only prokaryotes, but 
also such eukaryotes such as yeast, filamentous fungi, as well as plant and animal 
cells which can replicate and express an intron-firee DNA sequence of the 

30 invention. The term also includes any progeny of the subject cell. It is 
understood that not all progeny are identical to the parental cell since there may 
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be mutations that occur at replication. However, such progeny are included when 
the terms above are used. 

Transformation with recombinzmt DNA may be carried out by 
conventional techniques well known to those skilled in the art. Where the host 
5 is prokaryotic, such as E. coliy competent cells which are capable of DNA uptake 
can be prepared from cells harvested after exponential growth phase and 
subsequently treated by the CaCl2 method using procedures well known in the art. 
Alternatively, MgCl2 or RbCl could be used in the reaction. Transformation can 
also be performed after forming a protoplast of the host cell. 

10 Where the host is a eukaryote, various methods of DNA transfer 

can be used. These include transfection of DNA by calcium phosphate- 
precipitates, conventional mechanical procedures such as microinjection, insertion 
of a plasmid encased in liposomes, spheroplast electroporation, salt mediated 
transformation of unicellular organisms or the use of virus vectors. 

15 Analysis of eukaryotic DNA has been greatly simplified since 

eukaryotic DNA can be cloned in prokaryotes using vectors well known in the art. 
Such cloned sequences can be obtained easily in large amounts and can be altered 
in vivo by bacterial genetic techniques and in vitro by specific enzyme 
modifications. To determine the effects of these experimentally induced changes 

20 on the function and expression of eukaryotic genes, the rearranged sequences must 
be taken out of the bacteria in which they were cloned and reintroduced into a 
eukaryotic organism. Since there are still many functions in eukaryotic cells 
which are absent in prokaryotes, (e.g., localization of ATP-generating systems to 
mitochondria, association of DNA with histones, mitosis and meiosis, and 

25 differentiation of cells), the genetic control of such functions must be assessed in 
a eukaryotic environment. Cloning genes from other eukaryotes in yeast has been 
useful for analyzing the cloned eukaryotic genes as well as other yeast genes. A 
number of different yeast vectors have been constructed for this purpose. All 
vectors replicate in E. coliy which is important for amplification of the vector 

30 DNA. All vectors contain markers, e.g., LEU 2, HIS 3, URA 3, that can be 
selected easily in yeast. In addition, these vectors also carry antibiotic resistance 
markers for use in E. coli. 
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Many strategies for cloning human homologues of known yeast 
genes are known in the art. These include, but are not limited to: 1) low 
stringency hybridization to detect shared nucleotide sequences; 2) antibody 
screening of expression libraries to detect shared structural features; and 3) 
5 complementation of mutants to detect genes with similar functions. 

For purposes of the present invention, protein kinases which are 
homologous can be identified by structural as well as functional similarity. 
Structural similarity can be determined, for example, by assessing amino acid 
homology or by screening with antibody, especially a monoclonal antibody, which 

10 recognizes a unique epitope present on the protein kinases of the invention. When 
amino acid homology is used as criteria to establish structural similarity, those 
amino acid sequences which have homology of at least about 35 % in the protein 
kinase domain with the prototypical HRR25 protein are considered to uniquely 
characterize polypeptides. 

15 Conserved regions of amino acid residues in HKR25 can be used 

to identify flKZf2J-like genes from other species. Conserved regions which can 
be used as probes for identification and isolation of ^r/UC25-like genes 
(homologues) include the nucleotides encoding amino acid sequences GPSLED 
(amino acids 86 to 91 in SEQ ID NO: 2), RDIKPDNFL (amino acids 127 to 135 

20 in SEQ ID NO: 2), HIPYRE (amino acids 164 to 169 in SEQ ID NO: 2), and 
SVN (amino acids 181 to 183 in SEQ ID NO: 2), for example. These conserved 
motifs can be used, for example, to develop nucleotide primers to detect other 
HRZE25-like genes by methods well known to those skilled in the art, such as 
polymerase chain reaction (PCR). 

25 When homologous amino acid sequences are evaluated based on 

functional characteristics, then a homologous amino acid sequence is considered 
equivalent to an amino acid sequence of the invention when the homologous 
sequence is essentially unable to repair (in the case of the repair defective mutant 
gene) or able to repair (in the case of the natural gene), DNA double-strand 

30 breaks, including that which occurs at a nucleotide cleavage site 
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CAACAG 

GTTGTC 
t 

5 and when the homologous amino acid sequence allows normal mitotic 
recombination. 

This invention provides screening methods whereby genes are 
cloned from plasmid libraries by complementation of a recessive marker. A 
recipient strain such as Saccharomyces cerevisiae is constructed that carries a 

10 recessive mutation in the gene of interest. This strain is then transformed with 
a plasmid, for example, pYES2 (Invitrogen, San Diego, CA) containing the wild- 
type genomic DNA or cDNA. The clone carrying the gene of interest can then 
be selected by replica plating to a medium that distinguishes mutant from wild- 
type phenotypes for the gene of interest. The plasmid can then be extracted from 

15 the clone and the DNA studied. Several yeast vectors allow the application of 
complementation systems to go beyond isolation of yeast genes. Genes from a 
wide variety of species can be isolated using these vectors. In such systems, 
DNA sequences from any source are cloned into a vector and can be screened 
directly in yeast for activities that will complement specific yeast mutations. 

20 In a preferred embodiment, the present invention uses a mutation 

in yeast, the hrr25 mutation, which was identified by sensitivity to DNA double- 
strand breaks induced by the HO endonuclease. The genomic DNA which 
complements this mutation was isolated by transforming the hrr25 strain with a 
DNA library and subsequently screening for methylmethane sulfonate (MMS) 

25 resistance. Alternately, functional genes from a variety of mammalian species can 
now be cloned using the system described. 

Yeast genes can be cloned by a variety of techniques, including use 
of purified RNA as hybridization probes, differential hybridization of regulated 
RNA transcripts, antibody screening, transposon mutagenesis, cross suppression 

30 of mutant phenotypes, cross hybridization with heterologous CDNA or 
oligonucleotide probes, as well as by complementation in £. coli. 
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Minor modifications of the primary amino acid sequence may result 
in proteins which have substantially equivalent or enhanced activity as compared 
to the sequence set forth in SEQ. I.D. NO. 2. The modifications may be 
deliberate, as by site-directed mutagenesis, or may be spontaneous by HRR25 
5 producing organisms. All of these modifications are included in the invention as 
long as HRR25 activity is retained. Substitution of an aspartic acid residue for a 
glycine acid residue at position 151 in the sequence shown in SEQ. I.D. NO. 2 
identifies the mutant hrr25. 

Antibodies provided by the present invention are immunoreactive 

10 with the mutant polypeptides and/or the naturally occurring protein kinase. 
Antibody which consist essentially of numerous monoclonal antibodies with 
different epitopic specificities, as well as distinct monoclonal antibody 
preparations are provided. Monoclonal antibody is made from antigen containing 
fragments of the polypeptide by methods well known in the art (Kohler,G. et aL , 

15 Nature 256:495, 1975; Current Protocols in Molecular Biology , Ausubel, F. et 
aL, ed.,1989). 

The invention also discloses a method for identifying a composition 
which affects the activity of a polypeptide having tyrosine kinase activity. The 
polypeptide is capable of promoting restoration of DNA double-strand break 

20 repair activity in host cells containing the hrr25 gene. The composition and the 
polypeptide are incubated in combination with host ceUs for a period of time and 
under conditions sufficient to allow the components to interact, then subsequently 
monitoring the change in protein kinase activity, for example, by decreased repair 
of DNA double-strand breaks. The DNA strand breaks are induced, for example, 

25 by a radiomimetic agent, such as methylmethane sulfonate, x-rays, or by 
endonuclease like HO, Other means of inducing double-strand breaks that are 
well known in the art may be employed as well. 

One embodiment of the invention provides a method of treating a 
cell proliferative disorder associated with or HKR25 or an fflJR25-like protein 

30 comprising administering to a subject with the disorder, a therapeutically effective 
amount of reagent which modulates an ffl?Z?25-like protein activity. The term 
"cell proliferative disorder" denotes malignant as well as non-malignant cell 
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populations which differ from the surrounding tissue both morphologically and/or 
genotypically. Such disorders may be associated, for example, with abnormal 
expression of HRR25 -like protein genes. "Abnormal expression" encompasses 
both increased or decreased levels of expression as well as expression of mutant 
5 forms such that the normal function of HRR25'like genes is altered. Abnormal 
expression also includes inappropriate temporal expression during the cell cycle 
or expression in an incorrect cell type. Antisense polynucleotides of the invention 
are useful in treating malignancies of the various organ systems. Essentially, any 
disorder which is etiologically linked to altered expression of HRR25-\ikc genes 

10 is a candidate for treatment with a reagent of the invention. "Treatment" of cell 
proliferative disorder refers to increasing or decreasing populations of malignant 
or non-malignant cells. 

As used herein, the term "modulate" envisions the suppression of 
HRR25-likc protein expression or the augmentation of expression. When a cell 

15 proliferative disorder is associated with HRR25-\ike gene overexpression, 
appropriate reagents such as antisense or binding antibody can be introduced to 
a cell. This approach utilizes, for example, antisense nucleic acid and ribozymes 
to block translation of a specific HRR25'likj^ protein mRNA, either by masking 
that mRNA with an antisense nucleic acid or by cleaving it with a ribozyme. 

20 Alternatively, when a cell proliferative disorder is associated with insufficient 
HRR25-likc protein, a sense polynucleotide sequence (the DNA coding strand) or 
HRR25-like polypeptide can be introduced into the cell by methods known in the 
art. 

As used herein, the term "therapeutically effective" refers to that 
25 amount of polynucleotide, antibody or polypeptide that is sufficient to ameliorate 
the HR/?25-associated disorder. "Ameliorate" denotes a lessening of the 
detrimental effect of the jFWC/?2J-associated disorder in the subject receiving 
therapy. 

Antisense nucleic' acids are DNA or RNA molecules that are 
30 complementary to at least a portion of a specific mRNA molecule (Weintraub, 
Scientific American, 262 :40. 1990). In the cell, the antisense nucleic acids 
hybridize to the corresponding mRNA, forming a double-stranded molecule. This 
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interferes with the translation of the mRNA since the cell will not translate a 
mRNA that is double-stranded. Antisense oligomers of about 15 nucleotides are 
preferred, since they are easily synthesized and are less likely to cause non- 
specific interference with translation than larger molecules when introduced into 
5 the target HRR25 producing cell. The use of antisense methods to inhibit the in 
vitro translation of genes is well known in the art (Marcus-Sakura, 
AnaLBiochem., 172:289, 1988). 

Ribozymes are RNA molecules possessing the ability to specifically 
cleave other single-stranded RNA in a manner analogous to DNA restriction 

10 endonucleases. Through the modification of nucleotide sequences which encode 
these RNAs, it is possible to engineer molecules that recognize specific nucleotide 
sequences in an RNA molecule and cleave it (Cech, 7. Amer, Med. Assn,, 
260:3030, 1988). A major advantage of this approach is that, because ribosomes 
are sequence-specific, only mRNAS with particular sequences are inactivated. 

15 There are two basic types of ribozymes namely, tetrahymena-typc 

and "hammerhead "-type. Terrahymena-type ribozymes recognize sequences which 
are four bases in length, while "hammerhead "-type ribozymes recognize base 
sequences 11-18 bases in length. The longer the recognition sequence, the greater 
the likelihood that sequence will occur exclusively in the target mRNA species. 

20 Consequently, hammerhead-type ribozymes are preferable to tetrahymena-typo 
ribozymes for inactivating a specific mRNA species and longer recognition 
sequences are preferable to shorter recognition sequences. 

The present invention also provides gene therapy for the treatment 
of cell proliferative disorders which are mediated by HRR25-likc polypeptides. 

25 Such therapy comprises introducing into cells of subjects having the proliferative 
disorder, the HRR25-]ikc antisense polynucleotide. Delivery of antisense 
polynucleotide can be achieved using a recombinant expression vector such as a 
chimeric virus or a colloidal dispersion system. Disorders associated with 
under-expression of HKR25 can similarly be treated using gene therapy with 

30 nucleotide coding sequences. 

Various viral vectors which can be utilized for gene therapy as 
taught herein include adenovirus, herpes vims, vaccinia, or, preferably, an RNA 
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virus such as a retrovirus. Preferably, the retroviral vector is a derivative of a 
murine or avian retrovirus. Examples of retroviral vectors in which a single 
foreign gene can be inserted include, but are not limited to: Moloney murine 
leukemia virus (MoMuLV), Harvey murine sarcoma virus (HaMuSV), murine 
5 mammary tumor virus (MuMTV), and Rous Sarcoma Virus (RSV), A number 
of additional retroviral vectors can incorporate multiple genes. All of these 
vectors can transfer or incorporate a gene for a selectable marker so that 
transduced cells can be identified and generated. By inserting an HRR25-like 
sequence of interest into the viral vector, along with another gene which encodes 

10 the ligand for a receptor on a specific target cell, for example, the vector is now 
target specific. Retroviral vectors can be made target specific by inserting, for 
example, a polynucleotide encoding a sugar, a glycolipid, or a protein. Preferred 
targeting is accomplished by using an antibody to target the retroviral vector. 
Those of skill in the art will know of, or can readily ascertain without undue 

15 experimentation, specific polynucleotide sequences which can be inserted into the 
retroviral genome to allow target specific delivery of the retroviral vector 
containing the HRR25-likc antisense polynucleotide. 

Since recombinant retroviruses are defective, they require assistance 
in order to produce infectious vector particles. This assistance can be provided, 

20 for example, by using helper cell lines that contain plasmids encoding all of the 
structural genes of the retrovirus under the control of regulatory sequences within 
the LTR. These plasmids are missing a nucleotide sequence which enables the 
packaging mechanism to recognize an RNA transcript for encapsidation. Helper 
cell lines which have deletions of the packaging signal include but are not limited 

25 to ^2, PA317 and PA12, for example. These cell lines produce empty virions, 
since no genome is packaged. If a retroviral vector is introduced into such cells 
in which the packaging signal is intact, but the structural genes are replaced by 
other genes of interest, the vector can be packaged and vector virion produced. 

Alternatively, NIH 3T3 or other tissue culture cells can be directly 

30 transfected with plasmids encoding the retroviral structural graes gag, pol and 
enVy by conventional calcium phosphate transfection. These cells are then 
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txansfected with the vector plasmid containing the genes of interest. The resulting 
cells release the retroviral vector into the culture medium. 

Another targeted delivery system for flK/C25-like antisense 
polynucleotides comprises a colloidal dispersion system. Colloidal dispersion 
5 systems include macromolecule complexes, nanocapsules, microspheres, beads, 
and lipid-based systems including oil-in- water emulsions, micelles, mixed 
micelles, and liposomes. The preferred colloidal system of this invention is a 
liposome- Liposomes are artificial membrane vesicles which are useful as 
delivery vehicles in vitro and in vivo. It has been shown that large unilamellar 

10 vesicles (LUV), which range in size from 0.2-4.0 ^^ni can encapsulate a 
substantial percentage of an aqueous buffer containing large macromolecules. 
RNA, DNA and intact virions can be encapsulated within the aqueous interior and 
be delivered to cells in a biologically active form (Fraley, et al. Trends Biochem. 
Sci., 6:77, 1981). In addition to mammalian cells, liposomes have been used for 

15 delivery of polynucleotides in plant, yeast and bacterial cells. In order for a 
liposome to be an efficient gene transfer vehicle, the following characteristics 
should be present: (1) encapsulation of the genes of interest at high efficiency 
while not compromising their biological activity; (2) preferential and substantial 
binding to a target cell in comparison to non-target cells; (3) delivery of the 

20 aqueous contents of the vesicle to the target cell cytoplasm at high efficiency; and 
(4) accurate and effective expression of genetic information (Mannino, et aL, 
Biotechniques, 6:682, 1988). 

The targeting of liposomes has been classified based on anatomical 
and mechanistic factors. Anatomical classification is based on the level of 

25 selectivity, for example, organ-specific, cell-specific, and organelle-specific. 
Mechanistic targeting can be distinguished based upon whether it is passive or 
active. Passive targeting utilizes the natural tendency of liposomes to distribute 
to cells of the reticulo-endothelial system (RES) in organs which contain 
sinusoidal capillaries. Active targeting, on the other hand, involves alteration of 

30 the liposome by coupling the liposome to a specific ligand such as a monoclonal 
antibody, sugar, glycolipid, or protein, or by changing the composition or size of 
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the liposome in order to achieve targeting to organs and cell types other than the 
naturally occurring sites of localization. 

The surface of the targeted delivery system may be modified in a 
variety of ways. In the case of a liposomal targeted delivery system, lipid groups 
5 can be incorporated into the lipid bilayer of the liposome in order to maintain the 
targeting ligand in stable association with the liposomal bilayer. Various linking 
groups can be used for joining the lipid chains to the targeting ligand. 

In general, the compounds bound to the surface of the targeted 
delivery system will be ligands and receptors which will allow the targeted 

10 delivery system to find and "home in" on the desired cells. A ligand may be any 
compound of interest which will bind to another compound, such as a receptor. 

The present invention will be better understood upon consideration 
of the following illustrative examples wherein: Example 1 addresses isolation of 
hrr25 mutant strains of Saccharomyces cerevisiae; Example 2 describes the 

15 isolation of HRR25 DNA by complementation screening; Example 3 is drawn to 
characterization of the DNA and putative amino acid sequence of HQRR25; 
Example 4 addresses microscopic analysis of HRR25 wild type and hrr25 mutant 
yeast morphology; Example 5 addresses the relationship of the amino acid 
sequence of HRR25 and three exemplary protein kinases which are not HRR25- 

20 like; Example 6 describes the isolation of DNAs encoding two 
Schizosaccharomyces pombe HRR25-like protein kinases; Example 7 is directed 
to isolation of DNA encoding another Saccharomyces cerevisiae protein, NUFl; 
Example 8 is drawn to isolation of DNAs encoding various eukaryotic species 
HRR25-like proteins including three human isoforms, CKIalHu, CKIa2Hu, and 

25 CKIaSHu; Examples 9 and 10 are respectively directed to determination of casein 
kinase and both serine-threonine kinase and tyrosine kinase activities for HRR25; 
Example 1 1 is drawn to the recombinant expression of HRR25 products and the 
generation of antibodies thereto; Example 12 relates to the isolation of human CKI 
isoforms, CKI7IHU and CKI72HU; Example 13 addresses isolation of another 

30 human isoform CKI5Hu; Example 14 describes complementation of yeast CKI 
mutants with human CKI isoforms; and Example 15 is directed to generation of 
monoclonal antibodies against peptide fragments of human CKIaHu isoforms. 
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The following examples are intended to illustrate but not limit the 
invention. While they are typical of those that might be used, other procedures 
known to those skilled in the art may alternatively be used. 

Rvam plp 1 

5 Isolation of hrr25 

5. cerevisiae strain K264-5B {MATa ho ura3 canl^ tyrl his? fys2 
adeS metis trp5 leul adeS) was employed for the mutant isolation. The yeast 
were transformed according to standard procedures with a 17IJ45-based integrating 
plasmid that contained a GALi, iO-regulated HO endonuclease and a transformant 

10 was mutagenized to approximately 50% survival with ethyl methanesulfonate 
(EMS), as described {Current Protocols in Molecular Biology, supra). The 
culture was spread onto glycerol-containing rich medium (YPG, to avoid petites), 
colonies were allowed to form at 30° C, and plates were replicated to glucose {HO 
repressing) and galactose {HO inducing) media. Mutants were identified by their 

15 inability to grow on galactose. Approximately 200 mutants were chosen for initial 
characterization and 62 maintained the gal- phenotype through repeated single 
colony purification. Among these, many were not complemented by various gal 
mutants. The remainder (25 mutants) were surveyed for overlapping DNA repair 
defects by determining sensitivity to ultraviolet (UV) irradiation and to methyl 

20 methane sulfonate (MMS). This screening method identified five alleles of known 
rod mutations and one new mutation. This new mutation hrr25-l {HO and/or 
radiation nepair), presented severe defects and was studied further. 

A recessive DNA repair defect is conferred by hrrlS-l that includes 
sensitivity to MMS. Hrr25-1 strains also show sensitivity at 5-20 Krad X- 

25 irradiation similar to that observed with mutations in the radiation repair genes 
RAD50 and RAD52 (Cole, et al, MoLCelLBioL, 9:3101, 1989). The hrr25'l 
strains are no more sensitive to UV irradiation than wild type and are not 
temperature sensitive for growth at 37 "^C. Unlike hypo- and hyper-rec rod 
mutants which have several of the hrr25'l phenotypes, hrr25'l strains undergo 

30 normal mitotic recombination (Cole, et aL, MoLCelLBioL, 2:3101, 1989). 
Sp>ontaneous gene conversion and crossing-over were the same for homozygous 
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hrr25-l and wild type strains. However, HRR25 is required for the correct 
completion of meiosis. The hrr25-l homozygotes showed less than 1 % spores 
(tetranucleate cells) under conditions that produced 75-80% spores in an isogenic 
wild type strain. The hrr25'l mutation could be complemented by a number of 
5 radiation sensitive mutations (radd, 50, 52, 54, and 57) that present some of the 
hrr25 phenotypes, suggesting that hrr25'l is a newly uncovered ra^-like mutation 
and not one of these previously described genes. These results also indicate that 
HRR25 plays a role in DNA repair and meiosis, but is not specifically required 
for the repair of spontaneous mitotic lesions by recombination. 

10 Example 2 

Isolation of HRR25 
The HRR25 gene was obtained by complementing for MMS 
sensitivity using a yeast genomic library constructed in the plasmid YCp50 (Rose, 
etal. Gene, 60:237, 1987). An hrr25-l strain, MHML 3-36d (ura3 hrr25), was 

15 transformed by standard methods (Nickoloff, et aL, J.MoLBioL, 207:527, 1989) 
to uracil prototrophy, transformants were amplified on media without uracil and 
replicated to media containing 0.01 % MMS. Among 1200 transformants, a single 
MMS resistant isolate was identified. Complementation for MMS sensitivity was 
found to segregate with the plasmid as determined by methods known in the art. 

20 A 12 kb genomic fragment was identified and complementing 

activity was localized to a 3, 1 kb BamHl-Sall fragment by transposon mutagenesis 
and subcloning. This region complemented DNA repair defects as well as meiotic 
deficiencies. Gene targeting experiments linked this cloned region to hrr25'l. 
Transposon insertion mutations within the BamRl-Sall fragment replaced into the 

25 cognate HRR25 genomic locus did not complement hrr25-l for MMS sensitivity, 
whereas adjacent chromosomal insertions outside the complementing region 
segregated in repulsion when crossed against hrr25'l. 

Mini-TniOLt/AT transposons (Huisman, et aL, Genetics, 116:191, 
1987) were used to delineate the approximate location of HKR25 on the 12 kb 

30 BamBl'Saa fragment. Insertions located to the left hand 9 kb (of the 12 kb 
genomic fragment) did not inactivate complementation of hrr25-l MMS resistance 
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compared with the un-mutagenized plasmid. Two insertions, located near an 
EcoRV site in the right hand 2 kb inactivated complementation. HRR25 
complementation activity was localized to a 3.4 kb Sail fragment. Approximately 
300 bp of this fragment (right hand side of the 12 kb) were part of the pBR322 
5 tetracycline resistance gene (between the BamHl site of PBR322-based YCp50). 
The HRR25 open reading frame spans an internal region across an EcoRV site and 
two BgUl sites within the right terminal 3 kb. 

The DNA sequence of the 3.1 kb fragment revealed a centrally 
located open reading frame of 1482 nucleotide. A transposon insertion mutation 

10 in this open reading frame inactivated HRR25 complementation whereas insertions 
elsewhere in the 12 kb clone did not affect HRR25 complementation. Transposon- 
mediated disruption of HRR25 also revealed several phenotypes not seen with 
hrr25'L As expected, a Tn70-based LUK transposon insertion (Huisman, et ah , 
Genetics, 116:191, 1987) into the middle of plasmid-bome HRR25 coding region 

15 inactivated complementation for MMS sensitivity. Transplacement of this 
insertion into the genomic HKR25 gene revealed a severe growth defect in 
addition to MMS sensitivity and meiotic inviability. This severe growth defect 
was not observed with hrr25-l strains. Wild type HRR25 strains doubled in rich 
media at 30 °C every 80-90 minutes whereas isogenic hiT25\\LUK strains and 

20 hrr25A doubled every 9-12 hours. hrr25'l had a doubling time of 2-4 hours. 

To determine whether the mutant phenotypes revealed by the 
hrr::LUK disruption allele represent a null phenotype, the entire HRR25 coding 
sequence was deleted. Briefly, deletion of the HRR25 coding sequence employed 
a hisG::URA3::hisG cassette (Alani, et al. Genetics, 116:541, 1988). The 3.1 

25 kb HRR25 Sail fragment was cloned into pBluescript (Stratagene, La Jolla, CA). 
This plasmid was digested with BgllL and the two BglJl fragments that span the 
entire HRR25 gene and its flanking sequences were deleted. Into this deletion was 
introduced the 3.8 kb BamHI-i3^/II hisG::URA3::hisG fragment from pNKY51 to 
create the hrr25A allele. Sail digestion yielded a linearized fragment that deleted 

30 the entire HRR25 locus. Yeast carrying the deletion-disruption allele (hrr25A) 
showed phenotypes identical to those with the hrr25::LUK allele for all properties 
examined, including MMS sensitivity, slow growth, and the sporulation defect, 
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indicating that wild-type HRR25 protein is associated with these processes and that 
the hrr25::LUK allele does not indirectly interfere with DNA repair, growth or 
sporulation. In direct parallel comparisons, the hrr25::LUK and hrr25A alleles 
behaved identically. 

5 Yeast strain MFH14 (MATa/MATa ura3/ura3) was transformed 

with fi^/n-linearized YCp50-fflf/?25::LUK to uracil prototrophy, heterozygous 
disruption of HKR25 was verified by Southern blot analysis, the diploid was 
sporulated by starvation for nitrogen and fermentable carbon sources, tetrads 
dissected and cells allowed to germinate at 30 °C for 7 days. After a normal 

10 germination period of 2 days, the severe growth defect of /zrr2J: :LUK suggested 
that the deletion of HRR25 was lethal. However, microscopic examination of 
scgregants revealed that hrr25: iLXJK germinating cells grew slowly and in every 
case examined (20/20 tetrads), slow growth, MMS sensitivity, and uracil 
prototrophy co-segregated. A color variation was seen with diploid MFH14 

15 segregants, due to mutations in adenine biosynthesis. MFH14 is ade5/ADE5 
ade2/Qde2, An ade5/ade2 strain was white, while an ADE5/ade2 strain was red. 

Example 3 

Sequence and Structure of the HRR25 GENE 
DNA sequencing of both strands of the HRR25 gene was done by 
20 uni-directional deletions employing Sequenase (USB, Cleveland, OH) and Exo- 
Meth (Stratagene, La Jolla, CA) procedures as described by the manufacturers. 
DNA and deduced amino acid sequences are set out respectively in SEQ. I.D. 
NOs. 1 and 2. Figure lA, shows the alignment of the amino acid sequences for 
HRR25, CDC28, KSSl, and RAFl. Figure IB shows a schematic representation 
25 of the structure of HRR25. The protein kinase homology is represented by a 
shaded region while the P/Q rich region is indicated by cross-hatchings. The 
mutant, hrr25, can be distinguished from HRR25 by one amino acid substitution. 
At position 151, an aspartic acid is substituted for glycine. 

The predicted translation product of HRR25 revealed an unexpected 
30 feature for a rad-^kc DNA repair function. HRR25 contains the hallmark 
signatures of sequence homology with the catalytic domain of serine/threonine 
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protein kinase superfamily members (Hanks, et ah , Science, 241 :42, 1988). For 
comparison, the HRR25 translation product was aligned with the catalytic domains 
for two subgroups of yeast protein kinases, the CDC28/cdc2 group and the 
KSS1/FUS3 group. Located between amino acids 15 and 30 is a region that 
5 contains the conserved GXGXXG region. Just C-terminal to this region is a 
conserved lysine and glutamic acid present in most known kinases. These regions 
are thought to function in the nucleotide binding and phosphotransfer steps of the 
kinase reaction (Hanks, et ah. Science, 241:42,1988). Between amino acid 
residues 120 to 150 are regions containing the HRD and DFG motifs, also found 

10 in most protein kinase family members. In addition, sequence examination of all 
known serine/threonine kinases indicates that HRK25 shares some additional 
similarities with the Raf/PKS/mos subgroup (Hanks, et al. Science, TAVAl, 
1988). The strongest homologies can be found in areas around the GXGXXG, 
DFG, and DXXSXG conserved regions in protein kinase catalytic domains. 

15 The functional relevance of the observed sequence similarity 

between HRR25 and protein kinases was studied by altering specific residues 
within the HRR25 kinase domain and examining the phenotypic consequences of 
these changes. A lysine at position 38 (Lys ^^) was mutated to an arginine 
residue by site directed mutagenesis, by methods known in the art. The 

20 mutagenic oligonucleotide SEQ. LD. NO. 22 was: 

5 '-CCTGATCGATTCC AGCCTGATCGCTACTTCTTCACC ACT-3 ' . 

Lys^^ in HRR25 corresponds to the lysine found in all known protein kinases, and 
this subdomain is involved in ATP binding. Mutations at the conserved lysine in 
protein kinases such as v-src, v-mos, and DBF2 inactivate these proteins. The 

25 mutant hrr25-l.ys^^ allele was incapable of complementing hrr25'l, hrr25::LUK, 
and hrr25A alleles for all properties examined, an indication that the HRR25 
kinase domain is required for in vivo function of HRR25. 

The predicted HRR25 translation product (SEQ. I.D. NO. 2) has 
a number of notable features outside the region of homology to protein kinase 

30 catalytic domains. For example, the last 100 amino acids is proline and glutamine 
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rich, containing 50 of these residues. Other proteins with regions rich in these 
two amino acids include the transcription factors Spl, jun, and HAP2, steroid 
hormone receptors, the 5. pombe rani kinase, and mofc-male germ cell-associated 
kinase (Courey, et aL, Cell, 55:887, 1988; Bohmann, et aL, Science, 238:1386, 
5 1987; Roussou, et al, MoLCell.BioL, 8:2132, 1988; Arriza, et al, Science, 
237:268, 1987; Matsushime, etaL, MoLCelLBioL, 10:2261, 1990). In the case 
of Spl and jun, the proline-glutamine regions are involved in transactivation, 
whereas the P/Q region in the human mineralocorticoid receptor is thought to 
serve as an intramolecular bridge. This proline-glutamine region in HRR25 might 

10 function as a structural feature for substrate interaction, or for subcellular 
localization. Also, the glutamine richness of this region is similar to the opa or 
M-repeat seen in the Drosophila and Xenopus Notch/Xotch proteins (Wharton, et 
al. Cell, 40:55, 1985; Coffman, et al. Science, 242:1438, 1990). The function 
of the opa repeat is not certain, but it is found in several Drosophila genes. 

15 Lastly, the sequence TKKQKY at the C-terminal end of the region homologous 
to protein kinases is similar to the nuclear localizing signal of SV40 large T 
antigen and yeast histone H2B (Silver, et al., J.CelLBioL, 109:983, 1989; 
Moreland, et aL, MoLCelLBioL, 7:4048, 1987). 

Example 4 

20 Microscopic Analysis of Germinating and 

Proliferating hrr25 Cells 
Photomicrographs of HRR25 and hrr25::LJJK colonics were taken 
after germination on rich medium. An MFH14 hrr25::LUK heterozygous 
transformant was dissected onto a thin film of YPD rich medium on a sterilized 

25 microscope slide and segregants were allowed to germinate under a coverslip by 
incubating the slide in a moist 30 "^C chamber. Photographs of colonies were 
taken after 2 days of growth. Phase contrast and DAPI staining of proliferating 
HRR25A and hrr25::LUK cells were compared. Cells were inoculated into YPD 
rich medium and grown at 30°C to a mid-log density of 1-3 X 10^ cells/ml, 

30 briefly sonicated to disrupt clumps, fixed with formaldehyde, and stained with 
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DAPI (Williamson, et aL, Meth.CelLBioL, 12:335, 1975). Many cells with 
hrr25::LUK lacked DAPI stainable nuclei. 

Microscopic examination of germinating and actively growing mid- 
log phase hrr25::LUK cells revealed aberrant cellular morphologies. Transposon 
5 disruption of HRR25 resulted in large cells, and 25-40% of cells were filamentous 
or extended. DAPI nuclear staining (Williamson, et al , Meth, CelhBioL , 12:335, 
1975) of mid-log populations showed that orderly cell cycle progression in hrr25 
mutants was lost. There were a large number of cells lacking DAPI-stainable 
nuclei which, by single cell manipulations proved to be inviable. Consistent with 

10 this nuclear segregation defect, the plating efficiency of /irr25; .iZ/AThaploids was 
also reduced to 75-80% of wild type. However, this reduction in plating 
efficiency is insufficient to account for the severe growth rate reduction. Plating 
efficiency was measured from mid-log phase cells by comparing the efficiency of 
colony formation on rich medium relative to the total number of cells determined 

15 by hemocytometer count. Cell populations were analyzed for DNA content 
distribution by flow cytometric analysis following staining with propidium iodide 
as described (Hutter, et al, J, Gen. Microbiol, 113:369, 1979). Cell sorting 
analysis showed that a large number of the cells in a haploid hrr25::LUK 
population were delayed in the cell cycle and exhibited G2 DNA content, but the 

20 population was not arrested uniformly in the cell cycle. 

Example 5 

Sequence Comparison of HRR25 with 
CDC28, KSSl. and RAFl 
The predicted translation product of HRR25 (SEQ. I.D. NO. 2) was 
25 compared with the catalytic domains of several members of the serine/threonine 
protein kinase superfamily. Initial sequence comparisons employed the UWGCG 
programs (Devereux, et aL, Nuc.Acids.Res., 12:387, 1984), whereas subgroup 
comparisons used the methods of Hanks, et al. , supra. HKR25 contains all eleven 
subdomains described by Hanks, et aL, supra. Structurally similar groupings 
30 were compared in the sequence comparisons. These included nonpolar chain R 
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groups, aromatic or ring-containing R groups, small R groups with near neutral 
polarity, acidic R groups, uncharged polar R groups, and basic polar R groups. 

CDC28 and KSSl represent members of two subgroups of 
serine/threonine protein kinases in yeast. CDC28 is involved in cell cycle 
5 regulation while KSSJ acts in the regulation of the yeast mating pathway. HRR25 
shows 21% identity and 41% similarity to CDC28 and 19% identity and 43% 
similarity to KSSl (Figure lA). HRR25 shows highest similarity to members of 
the Rafl/PKS/Mos family of protein kinases. Through the catalytic domain, 
HRR25 shows 30% identity and 49% similarity to Rafl, 



10 Example 6 

Identification. Isolation, and Analysis of 
Sc. pombe Hhpl-^ and Hhp2-^ Genes 



A. Isolation of the Hhpl-^ and Hhp2-^ Genes 

The clones were isolated by a two-pronged approach: i) DNA- 

15 based screening methods; and ii) direct complementation in S. cerevisiae hrr25 
mutant strains. Two genes were identified {Hhpl+ and Hhp2'\- - so named for 
HRR25 Homologue from Schizosaccharomyces ppmbe). Expression of Hhpl-^ in 
5. cerevisiae hrr25 mutants fully rescued all mutant defects. Expression of 
Hhp2-\- in 5, cerevisiae also rescued, to varying degrees, the defects associated 

20 with hrr25 mutations. 

DNA-based amplification of HRR25-like DNAs from 5c. pombe 
genomic and CDNA sequences prepared according to Fikes, et aL {Nature^ 
346:291-293, 1990) was conducted using polymerase chain reaction with the 
following partially degenerate oligonucleotide primers: 



25 



(1) Primer No. 4583 (SEQ. ID. NO. 13) representing top 
strand DNA encoding residues 16 through 23 of HRK25\ [1 
nmol/5 Ml], = 52^C; 
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(2) Primer No. 4582 (SEQ. ID. NO. 14) representing top 
strand DNA encoding residues 126 through 133 of HRR25; 
[1.5 nmol/5 /xl], = 54°C; 

(3) Primer No. 4589 (SEQ. ID. NO. 15) representing bottom 
strand DNA encoding residues 126 through 133 of HRR25; 
[0.5 nmol/5 ^1], = 54°C; 

(4) Primer No. 4590 (SEQ. ID. NO. 16) representing bottom 
strand DNA encoding residues 194 through 199 of HRR25; 
[2 nmol/5 ^1], = 38^C. 



10 Two series of amplifications were conducted using Perkin Elmer 

Automated apparatus; a first series using ffiU?25-based primer Nos. 4583 and 
4589 and a second series employing all four of the primers. In the first series, 
30 cycles of denaturation (94 °C, 1 min), annealing (48 "^C, 1 min), and extension 
(66*^0, 3 min) were performed and in a final cycle, the extension time was 

15 extended to 5 min. Reaction products were sized on an agarose gel revealing a 
prominent band of the expected size of about 306 bp. In the second series of 
amplifications, 30 cycles were carried out as above except that annealing and 
extension were carried out at 35°C and 60*^0, respectively. Three major products 
of the expected sizes (513 bp, 180 bp, and 306 bp) were developed in both 

20 genomic and CDNA libraries and were purified by preparative agarose gel 
electrophoresis. 

Products were cloned into M13mpl9 and sequenced by the dideoxy 
method (Maniatis, et ah , Molecular Cloning: A Laboratory Manual, 1982). Two 
classes of sequences were identified. A representative clone ft-om each class was 
25 radiolabelled with ^^P by random primed cut labeling to a specific activity of 10^ 
cpm//xg (Maniatis, et aL , supra) and used as a hybridization probe to isolate fiill 
length CDNA clones and to prove yeast genomic DNA in Southern blots and total 
RNA on Northern blots. Hybridization was carried out for 16 hours in a buffer 
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containing 6 x SSPE, 0. 1 % SDS, 5% dextran sulfate. Two genes were identified 
and designated Hhpl + and Hhp2+ for HRJR25 Homologues from Sc. ^ombe. 

For Hhp] + , 7 clones were identified (6 partial and 1 full length 
clone). For Hhp2+, 2 full length clones were identified. Both Southern and 
5 Northern analysis confirmed that these clones were from separate genes. These 
genes were sequenced using standard dideoxy method (Maniatis, et aL, supra). 
The nucleotide and deduced amino acid sequences for Hhpl + are set out in SEQ. 
ID. NOS. 3 and 4; the nucleotide and deduced amino acid sequences for Hhp2+ 
are set out in SEQ. ID. NOS. 5 and 6. 

10 B. Functional analysis of Hhpl -f- and Hhp2-\- in S, cerevisiae hrr25 mutants . 

5c. pombe Hhpl+ and Hhp2-\- cDNAs were cloned in a location 
which placed them under the control of the 5. cerevisiae alcohol dehydrogenase- 1 
(ADHl) promoter in a URA3-based vector pDB20 to allow for expression in S. 
cerevisiae (Fikes, et aL, supra). These resulting clones were analyzed for their 

15 ability to alter/modify the suppress phenotypes associated with the hrr25-l 
mutation and the hrr25A mutation following transformation into appropriate yeast 
strains by standard methods (Ito, et aL, J. BacterioL 153:163, 1983). 
Transformants were analyzed for their ability to overcome defects associated with 
the hrr25 mutations (Hoekstra, et aL, Science, 253:1031, 1991). Hhpl-^ 

20 expression fully complemented /zrr25-associated defects and was indistinguishable 
from wild type HRR25 in all analyses. Complementation was analyzed for the 
effect on DNA repair, cell cycle progression, cellular morphology, and 
sporulation. Hhp2'h complemented to a lesser degree than Hhpl+ (its 
complementation level was 50% -75% that of bona fide HKR25). The alteration 

25 of /2rr25-associated phenotypes was dependent upon the transformed yeast strains 
containing both a complementing Sc. pombe Hhp plasmid and having hrr25 
mutations. 

The degree of amino acid homology between HRR25 protein and 
Hhpl-k- protein is 73% through the kinase domain. The degree of similarity, 
30 which considers the presence of similar as well as identical amino acids, is greater 
than 85 % . The amino acid identity of HRR25 protein and Hhp2'¥ protein is 63% 
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with a percent similarity score of 80%. The intraspecies comparison of Hhpl-¥ 
protein to Hhp2-\- protein is 72% identity. This structural and complementation 
analysis clearly indicates that these Sc. pombe clones are functional homologues 
of the S. cerevisiae HRK25, Such a high degree of relatedness is not seen with 
5 any other group of protein kinases. As a measure of comparison here, bona fide 
functional homologues (i.e., cdc2 protein kinases from S. cerevisiae. Sc. pombe, 
and humans) show 40%-45% identity. Any two randomly compared protein 
kinases, regardless of whether the comparison is inter-or intra-species show a 
degree of identity of about 20% -25%. 

10 C. Disruption and mutation of Hhpl+ and Hhv2+ in Sc, pombe 

Mutations that inactivate or reduce the protein kinase activity of 
HRR25 in 5. cerevisiae result in a wide variety of phenotypes including: 
sensitivity to various forms of DNA damage, severe cell cycle delay, sensitivity 
to drugs 

15 that affect cell cycle progression (e.g., caffeine), sensitivity to agents that affect 
microtubule integrity (e.g., benomyl), and sensitivity to agents that affect the 
integrity of replicating DNA (e.g., hydroxyurea). 

Similarity, in Sc. pombe, inactivation of the Hhpl-\- and the 
Hhp2-\- genes to reduce or abolish the encoded protein kinase activity resulted in 

20 cellular phenotypes that mimicked hrr^S mutations. For example, deletion of the 
Hhpl+ gene resulted in a cell cycle delay and aberrant cellular morphology, in 
sensitivity to DNA damaging agents like MMS, and in sensitivity to benomyl and 
hydroxyurea. Deletion of the Hhp2'\- gene resulted in caffeine sensitivity, 
benomyl sensitivity, and hydroxyurea sensitivity, amongst other defects. 

25 The Hhpl -h gene was disrupted as follows: CDNA was subcloned 

into the 5c. pombe vector pHSS19 (Hoekstra et al, Meth. EnzymoL, 194:329, 
1991), which was digested with Nhel-EcoRI. The Sc. pombe URA4 gene was 
inserted resulting in deletion of the Hhpl-^ kinase domain. Sc. pombe was 
transformed by standard methods (Moreno, et al, Meth. EnzymoL, 194:795, 

30 1991) with the linearized DNA from the resulting plasmid construction. Stable 
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transformants were identified and haploid hhplt. strains were verified by standard 
methods (Moreno, et aL, Maniatis, et aL). 

The Hhp2+ gene was disrupted as follows: the Hhp2-f CDNA 
was cloned into the Sc. pombe based vector, plasmid pHSS19, and was disrupted 
5 by transposon shuttle mutagenesis using the mini-Tn3 transposon mTn3Leu2 
(Hoekstra, et aL, Meth, Enzymol supra.). Sc. pombe was transformed by 
standard methods with the linearized DNA from the resulting plasmid 
construction. Stable transformants were identified and haploid hhp2A strains were 
verified by standard methods (see above). 

10 Standard physiological methods as described for 5. cerevisiae 

HRR25 (Hoekstra, et aL, Science 253:1031, 1991) were employed to characterize 
hhp mutant strains. Phenotypic analysis revealed that both hhpl and hhp2 mutants 
showed defects previously seen in hrr25 mutants, including sensitivity to various 
DNA damaging treatments that include MMS treatment and X-ray treatment. 

15 The foregoing substantiates that Hhpl-^- and Hhp2+ are isoforms 

of S, cerevisiae HRR25 protein kinase. These three protein kiniases show high 
levels of sequence identity. In addition, mutations that inactivate these kinases 
result in very similar defects in widely divergent organisms. 

D. Complementation of Sc. pombe mutant strains with the 5. cerevisiae 

20 HRR25 gene . 

To show that Sc. pombe hhp mutants prepared as described above, 
were identical to S. cerevisiae hrr25 mutants and to show that fl7f7J25-like protein 
kinases with greater than 35% amino acid identity are functional homologues, the 
S. cerevisiae HKK25 gene was introduced into a Sc. pombe expression vector and 

25 transformed into Sc. pombe hhp mutants. The DNA sequence at the HRR25 
initiating methionine was changed into an Ndel site, (a silent coding alteration that 
maintains the reading frame but allows the HRR25 gene to be introduced into 
appropriate Sc. pombe plasmids). This was done by a site-directed DNA change 
was made in the S. cerevisiae HRR25 gene by standard methods using a 

30 commercially available system (Bio-Rad, Cambridge, MA). The altered HRR25 
gene was ligated into the Sc. pombe expression plasmid, pREP 1 (Maundrell, K. 
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J., BioL Chem. 265:10857, 1990), at an Ndel site and the resulting construction 
was transformed by standard methods into Sc, pombe hhp mutants. Expression 
of HKR25 in 5c. pombe mutant strains resulted in complementation of the mutant 
defects as evaluated by physiological methods described by Hoekstra, et al. 
5 (Science, supra). 

Example 7 

Isolation and Characterization of 
Yeast HRR25-\ike Genes 
Isolation of additional HRR25'likt genes from 5. cerevisiae was 

10 accomplished by performing DNA-based amplification of genomic DNA from an 
5. cerevisiae strain lacking HRR25 coding sequences [Strain 7D of DeMaggio, et 
al. (Proc. Natl Acad. Sci., USA, 89:7008-7012, 1992, incorporated herein by 
reference) thereby eliminating the chance of obtaining HKR25 sequences from the 
amplification. The primers and amplification conditions were as in Example 6. 

15 The resulting amplification products were cloned in M13mpl9 and 

sequenced by dideoxy chain termination methods. Three unique classes of 
amplified products were identified. Two of these products respectively 
corresponded to the YCK1/CKI2 and YCK2/CKI1 genes of Robinson, et al 
(Proc. Natl. Acad. Sci. USA, 89:28-32, 1992) and Wang, et al (Molecular 

20 Biology of the Cell, 3:275-286, 1992). The third gene product was designated 
"NUFl" (for Number Four). The amplified products corresponding to NUFl 
were radiolabelled as described in Example 6 and used to screen a yeast YCpSO- 
based genomic library (ATCC, Rockville, MD). Eight clones were identified and 
one of these clones included approximately 4 Kb Hindlll fragment containing the 

25 NUFl hybridizing gene. Southern analysis revealed that NUFl is a separate gene 
from ffl?R25, YCK1/CKI2, and YCK2/CKI1. The Hindlll fragment was 
sequenced and revealed a protein kinase with about 65% identity to HRR25 
through its protein kinase domain. The DNA and deduced amino acid sequences 
for NUFl are set out in SEQ. ID. NOS. 23 and 24. 

30 To further characterize the NUFl gene, the Hindlll fragment was 

subcloned into the yeast plasmid YEplacll2 [Gietz and Sugino, Gene 74:527-541 
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(1988)]. The resulting construct was transformed into the hrr25A deletion strain 
7d and NUFl was found to complement for hrr25A mitotic defects (e.g., NUFl 
complemented for slow growth defect, aberrant morphology defect, DNA 
damaging agent sensitivities). Furthermore, a null mutant allele of NUFl was 
5 constructed by transposon shuttle mutagenesis and strains lacking the NUFl gene 
product were found to have hrr25A mutant-like defects. In particular, like hrr25A 
mutants, NUFl mutants showed slower mitotic growth rates and increased 
sensitivity to DNA damaging agents like MMS, UV, and X-irradiation. 

Example 8 

10 Identification and Isolation of 

Human HRR25-like Genes 
Oligonucleotides derived from amino acid sequences described 
above in Example 6A were used to amplify cDNAS from the following sources: 
Arabidopsis thaliana, Drosophila melanogaster, Xenopus, chicken, mouse, rat, 
15 and human HeLa cells. These cDNAS were obtained from reverse transcribed 
mRNA (Maniatis, et al, , supra) or from commercially-available cDNA libraries 
(Stratagene, La JoUa, CA, and Clonetech, Palo Alto, CA) Amplification products 
of similar migration size to those obtained from 5. cerevisiae HRR25 and Sc, 
pombe, Hhpl-^- and Hhp2-^ genes were observed in 1.0% Agarose gels 
20 (Maniatis, et al, , supra). This result indicated that HRR25-hke genes exist in all 
species examined. 

Isolation of full length DNAs encoding human HRR25-like protein 
kinases was accomplished by PGR amplification of human genomic DNA using 
unique sequence oligonucleotide primers based on portions of a bovine brain 
25 casein kinase I cDNA which had been reported in Rowles, et al. (Proc, NatL 
Acad. ScL USA, £8:9548-9552, 1991) to encode a mammalian protein that was 
60% homologous to HRR25 over its catalytic domain. 

A variety of primers were prepared and used in pairwise fashion 

including: 
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(1) Primer JH21 (SEQ. ID. NO. 17) representing bovine top 
strand DNA bases 47-67; 



(2) Primer JH22 (SEQ. ID. NO. 18) representing bovine top 
strand DNA bases 223-240; 



5 (3) Primer JH29 (SEQ. ID. NO. 19) representing bovine top 

strand DNA bases 604-623; 

(4) Primer JH30 (SEQ. ID. NO. 20) representing bovine top 
strand DNA bases 623-604; and 

(5) Primer JH31 (SEQ. ID. NO. 21) representing bovine top 
10 strand DNA bases 835-817. 



DNA amplification with combination of oligonucleotides 
JH21/JH30, JH22/JH30, and JH29/JH31 were carried out for 30 cycles with 
denaturation performed at 94°C for 4 min for the first cycle and for 1 min for the 
remaining cycle annealing at 50°C for 2 min and extension at 72°C for 4 min. 

15 Products of the expected size from the three amplifications were purified on 
preparative acrylamide gels and labeled with -^^P using random nick translation 
(to a specific activity between 7 x 10^ cpm/^g and 1.4 x 10^ cpm/^g. The 
labelled probes were employed as a group to screen a commercial human fetal 
brain cDNA library (Stratagene). Hybridization was carried out for 16 hours at 

20 65°C in a hybridization buffer containing 3 x SSC, 0.1% Sarkosyl, 10 x 
Denhart's solution and 20 mM sodium phosphate (Ph 6.8). Three washes at 65 ""C 
in 2 X SSC, 0. 1 % SDS were performed. Approximately 1.5 x 10^ plaques were 
screened on 30 plates using duplicate filters. Six strong positive clones were 
isolated, purified and converted to plasmid form according to procedures 

25 recommended by the supplier of the library. Restriction digestion revealed the 
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following insert sizes for the six clones: clone 35A1, Ikb; clone 35B1, 1.4kb; 
clone 41A1, 3.7kb; clone 42A1, >4kb; clone 47A1, 3.35kb; and clone 51A1, 
2.75kb. All six inserts contained sequences which could be aligned with both the 
DNAs and deduced protein sequence of the bovine CKIa gene. The abbreviated, 
5 partial cDNA clones 35 A 1 and 35B1 were not further analyzed. Clones 41 Al and 
42A1 were identical except for size. Clones 42A1, 51A1, and 47A1 were 
redesignated as CKIalHu, CKIq:2Hu, and CKlQr3Hu, The DNA and deduced 
amino acid sequences of the inserts are set out in SEQ. ID. NOS. 7 and 8; 9 and 
10; and 11 and 12, respectively. The deduced amino acid sequence for CKIalHu 
10 was identical to the reported bovine CKIa sequence. Table 1, below sets out 
differences in nucleotides between the bovine and human DNAs, numbered from 
the first base in the initiation codon, ATG. 
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TABLE 1 

COMPARISON OF HUMAN AND BOVINE CKIa DNA 
Position Bovine Human Position Bovine Human 
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The CKIa3Hu DNA also includes an insertion of 84 bases at 
position +454 in the coding sequence providing an intermediate extension of the 
CKIa2Hu expression product by 28 amino acids. This DNA insert is not present 
in the bovine gene, but it encodes an amino acid sequence insert which Rowles, 
25 et ah designated as CKI-alpha-L. The CKIa2Hu and CKIa3Hu DNAs insertion 
at position +971 of the CKIalHu DNA. This insertion is not found in any of the 
bovine sequences and encodes an extension of the 13 amino acids adjacent the 
carboxy terminal. The last two codons of the CKIa3Hu sequences differ from 
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any of the bovine sequences or the sequences of CKIalHu and CKIa;2Hu, causing 
the CKIaSHu expression product to terminate with a lysine, rather than a 
phenylalanine as found in all the other bovine and human casein kinase I 
sequences. The 3' flanking sequence of CKIaSHu DNA differs significantly from 
5 that of CKIalHu and CKIa2Hu. 

FIGURE 2 provides an alignment of the catalytic domain amino 
acid sequences of HRR25'likc proteins whose DNAs were isolated in the above 
illustrative examples, including /Wy?2J, Hhp\-¥ , Hhpl-^ , CKIalHu, CKlQr2Hu, 
and CKIa3Hu as well as YCK1/CKI2, and YCK2/CKI1. Note that with the 

10 exception of the CKIaSHu intermediate insert and the carboxy terminal region 
inserts of CKIa2Hu and CKIaSHu, the sequences of the three human products are 
identical. "Common" residues are indicated in the Figure where at least 3 of the 
seven residues are identical at the corresponding position (the human sequences 
being taken as a single sequence). 

15 Like Hhpl-\- and Hhp2-¥, the three human HRR25'like protein 

kinases showed very high degrees of amino acid identity to the HKR25 gene 
product (68%), establishing that these human clones were enzymatic isoforms of 
the yeast HRR25 gene. The alignment of HRR25, Hhpl-\-, Hhp2'\-, and the 
human complementing-like kinase isoforms show that these enzymes share a 

20 number of primary structural features that indicate that these enzymes provide 
comparable activities in different species. This conclusion is reached based on 
several lines of evidence. First, all enzymes share the common primary sequence 
identifiers characteristic of protein kinases. Second, the enzymes share high 
degrees of amino acid identity in regions of the protein kinase domain that are not 

25 conserved in unrelated protein kinases. Finally, these enzymes share regions of 
identity in the kinase domain which regions differ in primary sequence from other 
protein kinases, but are identical among the members of this isoform grouping. 
For example, greater than 95 % of all known protein kinases have a so-called A-P- 
E sequence (Alanine-Proline-Glutamate) approximately two-thirds of the way 

30 through the kinase domain. HRR25-lik& protein kinases lack the A-P-E sequence 
and have instead a S-I/V-N sequence (Serine-Isoieucine or Valine- Asparagine). 
. Based on this primary sequence comparison, between known protein kinases and 
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the protein kinases of the invention from evolutionarily divergent organisms, these 
enzymes of the invention are isoforms of HRR25 protein kinase. 

Example 9 

Comparison of HRR25 with a Casein Kinase 
5 In all eukaryotes examined, two of the major protein kinases are 

casein kinase I and II (CKI and CKII, respectively). These enzymes have been 
found in all cell types and species examined. Both enzymes recognize Ser/Thr 
residues in an acidic environment in the substrate. These two protein kinases are 
found throughout the cell and their activities have been purified from or found to 

10 be associated with cytoplasmic fractions, membranes, nuclei, mitochondria, and 
cytoskeleton. CKII is predominantly a nuclear enzyme, but similar studies have 
yet to be described for CKI. 

To determine whether HER25 gene product might function as a 
casein kinase, the ability of HR7J25-containing immunoprecipitates to 

15 phosphorylate casein was studied. H?f7?25-containing immunoprecipitates from 
yeast were incubated with casein and phosphorylated proteins were examined. 

Yeast extracts were prepared by physical disruption. Equal 
volumes of a cells were suspended in lysis buffer and acid-washed 0,5 mm beads 
were mixed, 30 second bursts were interspersed with 1 min on ice, and the extent 

20 of disruption was followed microscopically. Lysis buffer contained 10 Mm 
sodium phosphate (Ph 7.2), 150 Mm NaCl, 1% Nonidet P-40, 1% Trasylol, 1 
Mm DTT, 1 Mm benzamidine, 1 Mm phenylmethyl sulfonyl fluoride, 5 Mm 
EDTA, pepstatin (1 ug/ml), Pepstatin A (2 ug/ml), leupeptin (1 ug/ml), lOOmM 
sodium vanadate, and 50 Mm NaF. Extracts were clarified by a 100,000 x g 

25 centrifugation for 30 min., made to 50% (vol/vol) with glycerol, frozen in liquid 
nitrogen, and stored at -70 degrees C. Little loss in protein kinase activity was 
seen in frozen extracts over several months. 

Immune complex protein kinase assays were performed on the 
extracts according to the methods described in Lindberg, et al (Mol.CelL Biol. 

30 10:6316, 1991). Frozen extracts were diluted to 25% glycerol with lysis buffer 
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or fresh extracts were used directly. Extracts were precleared with preimmune 
serum and protein A-Sepharose, and then treated with immune serum (obtained 
as described in Example 11, infra, from immunization of rabbits with E. coli- 
derived \yp^-HRR25 fusion products), HRR25 kinase-containing immune 
5 complexes were precipitated with protein A-Sepharose. Immune complexes were 
washed four times with lysis buffer and twice with kinase buffer containing 15 
Mm Hepes (Ph 7.4), 100 Mm NaCl, and 10 Mm MgCl2. 

Reaction mixtures of HKR25 immunoprecipitates and heat-treated 
casein (300 ng/20ul reaction volume) were incubated at 30 degrees C for 5-10 min 

10 and contained 10 uCi of gamma--^^P-ATP per 20 ul reaction volume. Reactions 
were stopped by the addition of SDS and EDTA, boiled in SDS/PAGE sample 
buffer and resolved in 10 % gels. Phosphoamino acid analysis was as described 
(Hunter et al, Proc. Natl Acad. SciMSA 77:1311, 1980). 

Immunoprecipitates from HRR+ strains were able to phosphorylate 

15 casein. To verify that the appropriate amino acids were phosphorylated, the 
phosphoamino acid composition of the ffifR25-phosphorylated casein was 
examined by phosphoamino acid analysis. Samples were resolved by two- 
dimensional electrophoresis at Ph 1.9 and Ph 3.5. Consistent with mammalian 
CKI specificity, serine and threonine residues were phosphorylated. HRR25 

20 phosphorylated serine residues on casein 3-fold greater than threonine residues. 
Similarly, the autophosphorylation of HRR25 in immune complexes in vitro 
occurred on serine and threonine residues. Coupled with the high degree of 
sequence identity, these results suggest that HKR25 might be a CKI isoform. 

To extend and confirm that HRR25 immunoprecipitates from yeast 

25 could phosphorylate casein, several experiments were performed. HKR25 
immunoprecipitated from E. coli strains expressing HKR25 (See Example 1 1) also 
showed casein kinase activity, whereas £. coli extracts lacking HRR25 protein did 
not phosphorylate casein. //i?/f25-containing baculovirus constructs produced 
casein kinase activity in immunoprecipitates. Wild-type baculovirus-infected cells 

30 showed (0.5% casein kinase activity under comparable conditions. The protein 
kinase activity from S19 cells expressing HKR25 protein was sensitive to the same 
conditions that reduced or inactivated the HRR25 protein activity from yeast 
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extracts. The observations that Hiy?25-dependent casein kinase activity was 
present in immunoprecipitates from E, coli cells expressing wild-type HRR25, in 
insect cells infected with ///f/J25-containing baculovirus, and in wild-type but not 
hrr25A mutants indicated that the HKR25 gene product could function as a casein 
5 kinase and that the casein kinase activity in HRR25 protein-containing 
immunoprecipitates was due to HRR25 gene product. 

Example 10 

Analysis of Protein Kinase 
Activity of HRR25-\\\^^ Proteins 

10 Because the predominant protein kinase activity in E. coli is 

histidine kinase, rather than serine/threonine or tyrosine kinase, those procaryotic 
cells provide a system for examination of HRR25Ai\i^ protein kinase activities 
which is not compromised by presence of endogenous kinases. Both HRR25 and 
Hhpl + DNAs were, therefore, expressed in the IPTG-inducible T7 gene 10-based 

15 commercial expression system (Invitrogen, San Diego, CA) using £. coli strain 
BL21 (DE3) which contains an IPTG-inducible T7 RNA polymerase and T7 
lysozyme gene. See, DeMaggio, et al, Proc. Natl Acad. Sci, USA, 89:7008- 
7012, (1991). In a first series of experiments, E. coli lysates were prepared by 
inducing mid-log phase cells with IPTG for 2 hours, pelleting the cells, and 

20 preparing extracts by a freeze-thaw method using buffers described in DeMaggio, 
et al, supra. Extracts were electrophoresed in polyacrylamide gels, transferred 
to nylon-based support membranes, and probed by Western analysis with 
antibodies directed against phosphotyrosine (UBI, Lake Placid, NY). These 
procedures revealed that HRR25 and Hhpl+ expressing cells contained novel 

25 tyrosine phosphorylated proteins not observed in control cells (transformed with 
the vector alone or with kinase inactive mutants). In a second experiment, the 
HRR25 and ///zp7-f -containing £. coli strains were examined for tyrosine- 
phosphorylated protein by a sensitive and accurate radiolabelling and 
phosphoamino acid procedure. To do this experiment, cells were induced with 

30 IPTG and grown in the presence of ^^P-orthophosphate. Radiolabelled extracts 
were prepared by the freeze-thaw method, electrophoresed in polyacrylamide gels, 
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and the gels were examined by autoradiographic methods. Novel phosphoproteins 
were observed in the strains expressing HRR25 and Hhpl + , but not in the above 
controls. Phosphoproteins were examined by extracting and hydrolyzing the 
proteins from the gels using standard methods (Boyle, et aL, Meth. Enzymol, 
5 201:110, 1991). These experiments verified that HRR25 and Hhpl+ could 
phosphorylate tyrosine, serine, and threonine residues on protein substrates. 

Example 11 

Recombinant Expression of HRR25 
Products and Generation of Antibodies Thereto 

10 Two different plasmid constructions were developed for expression 

of HRR25 DNA in £. coli to generate immunogens useful in preparation of anti- 
HRR25 antibodies. 

The first plasmid construction involved plasmid pATH according 
to Koemer et ah , Meth. EnzymoL , 194:477-491 (1991). An approximately [606] 

15 base pair DNA fragment was isolated from the HRR25 open reading frame by Bgl 
II digestion and this fragment (which encodes amino acid residues 275-476) was 
ligated into pATH which had been digested with BamlU. The resulting plasmid 
encoded a fusion protein comprising the E, coli TrpE gene product at its amino 
terminus and a carboxy terminal fragment of HRR25 at its carboxyl terminus. 

20 Inclusion bodies were isolated from £. coli DHSa (Bethesda 

Research Laboratories, Bethesda, MD) host cells transformed the plasmid using 
lysis buffers as described in Koemer et aL, supra, and were purified by 
polyacrylamide gel electrophoresis. The gel purified materials were then 
employed in the immunization of rabbits by subcutaneous injection as 

25 recommended by Harlow, et al. , Antibodies: A Laboratory Manual , Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY (1988), using gel purified products 
with complete Freund's adjuvant for primary injections and incomplete Freund's 
adjuvant for subsequent injections. Serum reactivity was followed by Western 
blotting against the gel purified antigen. Affinity purification of serum antibodies 

30 was effected using the E, coZ/-produced antigen immobilized on a nitrocellulose 
membrane support. 
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Example 12 
Isolation of CKl7lHu and CKI72HU 
Additional human HRR25-like protein kinase encoding DNAs were 
isolated by combined DNA amplification and library screening methods. 
5 Oligonucleotides based on conserved regions in HRR25-like protein kinases were 
used to ampli:fy DNA segments for use as probes in screening human a cDNA 
library. Redundant oligonucleotides of the sequence 

5 ' GAR Yn MGI YTI GGI AAY YTI TA-3 ' (SEQ ID NO. 28) 

and 

10 5 -GTY TTR TTI CCI GGI CKI CCI AT-3 ' (SEQ ID NO. 29) 

(where G, A, T, and C = standard nucleotides and R = A and G; Y = C and 
T; I = Inosine; M = A and C; and K = G and T) were used to amplify an 
approximately 540 nucleotide from a human fetal brain cDNA library (Clonetech). 
Amplification conditions used 200 Mm Tris.Hcl (Ph 8.2), 100 Mm KCl, 60 Mm 

15 (NH4)2S04, 15 Mm MgC12, 1% Triton X-100, 0.5 /xM of each primer, 100 ng 
library DNA template, 200 ^M dNTPs and 2.5 U polymerase. The reactions 
were performed for 30 cycles. Reactions were started with a 4 minute treatment 
at 94^C and all cycles were 1 minute at 94^ C, 2 minutes at 5*^0 for annealing, 
and 4 minutes at 72 "^C for extension. 

20 The amplification reaction was electrophoresed through a 1% 

agarose gel and the region corresponding to approximately 540 base pairs was 
excised and DNA was eluted using a Nal extraction and glass powder binding 
(GeneClean, BiolOl, La JoUa, CA). The gel-purified fragment was ligated into 
Smal-digested Bluescript II SK(+) and the resulting plasmid contained a partial 

25 protein kinase domain that was used as a source of cDNA for library screening. 
Ten micrograms of this plasmid was digested with EcdRl and BamHl to liberate 
the subcloned fragment and the reaction was electrophoresed through a 1% 
agarose gel. The approximately 540 nucleotide fragment was eluted from the gel 
and was radiolabelled by random primed oligonucleotide directed labelling 

30 (Amersham, Arlington Heights, IL) using -^^P-dCTP as the radioactive nucleotide. 
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The radioactive probe was used to screen a human Manca B cell lymphoma 
library [Wiman, et al. , Proc.Natl.Acad.ScL (USA) 87:6798-6802 (1984)] prepared 
in phage cloning vector XgtlO prepared as follows. Poly d(A)'^RNA 

was prepared from 2.8 x 10^ cells of the B-cell lymphoma Manca using the "Fast 
5 Track" kit (Invitrogen). 5 ftg of RNA was used for oligo d(T) primed cDNA 
synthesis with the cDNA Synthesis System (Gibco BRL, Burlington, Ontario, 
Canada); the resulting cDNA was size selected by agarose gel electrophoresis and 
ligated to EcdKl adapters with the Ribo Clone kit (Promega, Madison, WI). 
Varying amounts of the adapted cDNA were ligated to J5:coRI-digested XgtlO with 

10 1 unit of T4 DNA ligase (Boehringer Mannheim, Indianapolis, IN) in a 
commercially prepared buffer supplied by the manufacturer with the enzyme. The 
ligations were packaged with Gigapack packaging extracts (Stratagene) and the 
resulting phage pool (1.5 x 10^ phage) was amplified in the C600 Hfl strain. A 
total of 1 X 10^ phage plaques were screened by standard hybridization methods 

15 (Maniatis, et aL, supra). Hybridizations were at 65 ""C for 18 hours in 6X SSPE 
(20X SSPE is 175.3 g/1 NaCl, 27.6 g/1 NaH2P04.H20), 7.4 g/1 EDTA, pH 7.4), 
100 Mg/ml salmon sperm carrier DNA, 5X Denhardt Reagent (50X Denhardts is 
5% ficoll, 5% polyvinyl pyrolidone, 5% bovine serum albumin), 0.1% SDS and 
5% sodium dextran sulfate. Filters were washed four times in O.IX SSPE, 1% 

20 SDS. Each wash was at 65 °C for 30 minutes. Five clones were chosen for 
further analysis. 

DNA from these phage clones was prepared using a Qiagen lambda 
DNA preparation kit (Qiagen, Chatsworth, CA) and human cDNA inserts were 
excised by EcoKL digestion. These inserts were subcloned into ficoRI-digested 

25 plasmid Bluescript II SK(-h) (Stratagene) and the inserts were sequenced using an 
ABI 373A automated DNA sequencer. Two of the five cDNA contained near 
full-length cDNAS with a polyA tail and a protein kinase open reading frame. 
These protein kinases were most closely related to isoforms of casein kinase I 
were designated CKIylHu and CKI72HU. The DNA sequences of CKI7IHU and 

30 CKI72HU are set out in SEQ ID NOS: 30 and 32, respectively; the deduced 
amino acid sequences of CKI7IHU and CKI72HU are set out in SEQ ID NOS: 31 
and 33, respectively. 
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Example 13 
Isolation of CKI3Hu 
Human CKI6 was subcloned by first isolating the human gene from 
a human fetal brain library constructed in XZAPII (Stratagene). A 2.2 kb EcoRI 
5 fragment containing rat CKI5 was gel purified through 1 % agarose, isolated from 
the gel by Nal extraction with glass powder (BiolOl, La JoUa, CA), and 
radiolabelled by random primer methods (Boehringer Mannheim) using •'^P-dCTP. 
This probe was used to screen 1 X 10^ plaques containing human fetal brain 
cDNA library. Plaque hybridization conditions were 3X SSC, 0.1% Sarkosyl, 
10 lOX Denhardts reagent, 50 ftg/ml salmon sperm DNA carrier. Hybridization was 
allowed to proceed for 18 hours at eS'^C after which time the filters were washed 
4 times for 30 minutes each at 65X in 2X SSC, 1.0% SDS. Positive clones 
were identified by autoradiography at -70"* C with an enhancing screen and 
sequenced using an automated ABI373A DNA sequencer (Applied Biosystems, 
15 Foster City, California),. 

One clone was determined to encode a full length CKI5 isoform and 
was designated CKI5Hu, The nucleotide sequence for CKISHu is set out in SEQ 
ID NO: 34, and the deduced amino acid sequence is set out in SEQ ID NO: 35. 

Expression of the CKI5Hu isoform was then determined in eight 
20 different human tissues using an approximately 1,2 kb EcoRI fragment as a probe. 
, CKISHu mRNA levels were highest in kidney, liver and placenta cells, in contrast 
to the testes-specific expression of rat CKI5 demonstrated by Graves, et 
aL, [supra]. 
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Table 2 - Sequence Homology Between CKI Isoforms 
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Human 


Human 


Human 
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50 


50 
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Human CKIal 




100 


52 


52 


76 


Human CKI7I 






100 


99 


55 


Human CKI72 








100 


55 


Human CKI5 










100 



Example 14 

Complementation of Yeast CKI Mutants 
by Human CKI Genes 

10 In order to determine if CKI7IHU was an isoform of yeast HRR25- 

like protein the gene was expressed in yeast protein kinase mutants. The cDNA 
was expressed under control of the yeast GALl promoter. The expression 
plasmid was a derivative of plasmid pRS305 (Stratagene) that contains the yeast 
GALl promoter. The parental plasmid with the GALl promoter was previously 

15 described [Davis et ah. Cell 5i:965-978 (1990)] and contained a BglO. site 
adjacent to the GALl promoter as well as BamHI and Sac\ sites adjacent to the 
BgEL site. This plasmid was modified by site-directed mutagenesis to contain a 
unique Nco\ site between the GALl promoter and the Bglll site. The Nco\ site 
was adjacent to the GALl promoter such that the order of genetic elements was 

20 GALl promoter-M:t?I-5^/II-BflmHI-5flcI. Site-directed mutagenesis (MutaGene 
kit, BioRad) employed the oligonucleotide 

5 -CTA GAT CTA GCT AGA CCA TGG TAG TTT TTT CTC CTT GAC 
G-3 ' (SEQ ID NO. 36) 
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and generated a unique Ncol site (underlined in SEQ ID NO: 36). The resulting 
plasmid was called pRS305(N) 2/x GALl. 

To clone CKI7IHU into pRS305(N) 2^ GALl , the CKI7IHU cDNA 
was amplified from cDNA with oligonucleotides that would introduce an Ncol site 
5 at the initiating ATG and a BamRl site in the 3' untranslated region. The 
sequence of the mutagenic oligonucleotide (with the Ncol site underlined) for the 
amino terminus was 

5 '-CAT G CC ATG O CA CGA CCT AGT-3 ' (SEQ ID NO: 37). 

The oligonucleotide M13rev, purchased from Stratagene (Stratagene, La Jolla, 

10 CA) was used to introduce the BamRl site in the 3' untranslated region. 
Amplification conditions used 200 Mm Tris-HCl (Ph 8.2), 100 Mm KCl, 60 mM 
(NH4)2S04, 15 mM MgCl^, 1% Triton X-100, 0.5 ^M of each primer, 100 ng 
template, 200 ^M of each dNTP and 2.5 units polymerase. The reactions were 
performed for 30 cycles. Reactions were started with a 4 minute treatment at 

15 94 ''C and all cycles were 1 minute at 94 ""C for denaturing, 2 minutes at SO^'C for 
annealing, and 4 minutes at 72 ''C for extension. The amplified product was 
digested with Ncol and BamUl and was cloned into NcollBarnRl-digcsXcA 
pRS305(N) 2/x GALL 

Complementation of yeast CKI mutants employed yeast strains 7D 

20 {hrr 25 A, uraS-l, trpl-L leu2-3, 112, his3-llJ5, canl-lOO, ade2-l) 
[DeMaggio, et al , (1992) supra] and YI227 {ckilD, cki2D, FOA^, ade2-l, canl- 
100, his3'llJ5, Ieu2-3J2, trpl-L ura3-l, pRS415::Ckilts) Strain 7D lacked the 
HRR25 isoform of yeast CKI and strain YI227 contained a temperature sensitive 
allele of yeast CKIl. Yeast strains were transformed by lithium acetate-mediated 

25 transformation methods and transformants were selected on SD-leucine medium 
(BiolOl). Controls for transformation were plasmids pRS305(N) 2^g GALl 
alone, plasmid pRS315 (Stratagene), and plasmid pRS315::HRR25, which 
contains a Sall-EcoRI genomic fragment that spans the genomic HRR25 fragment 
[Hoekstra et al. Science, supra]. Plasmid pRS315::HRR25 was constructed by 

30 ligating a SaWEcdBl genomic fragment of HRR25 into Safl/EcoRI-digested 
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pRS315. Both HRR25 and CKIylHu, when expressed in yeast mutants, are 
capable of fully complementing for the temperature-sensitive growth defect of 
CKI. In addition, CKI7IHU partially suppressed a severe growth rate defect 
associated with HRR25 mutants. The partial suppression of HRR25 growth 
5 defects by CKI7IHU was detected by a 10-20 fold greater plating efficiency 
relative to pRS305(N) Ifi GALl. 

To extend the complementation analysis to additional CKI family 
members, the ability of other human CKIaHu and CKISHu genes to complement 
for the HRR25 mutant defects was examined. Human CKIalHu was subcloned 
10 into plasmid pRS305(N) 2fi GALl by first introducing an Ncol site at the 
initiating methionine by site-directed mutagenesis. The mutagenic oligonucleotide 
(with the Ncol site underlined) was 

5 -CTA GAT CTA GCT AGA CCA TGG TAG TTT TIT CTC CTT GAC 
G-3' 

15 (SEQ ID NO. 38) 

and mutagenesis was performed using the Mutagene kit (BioRad). The 
mutagenized cDNA was digested with Ncol and BgRl and the CKIalHu fragment 
was ligated into pRS305(n) Ifi GALl. 

Two constructs containing the CKISHu cDNA were examined for 
20 complementation. Plasmid pECTB (containing CKISHu cDNA) was used as a 
template for site-directed mutagenesis (MutaGene, BioRad). The mutagenic 
oligonucleotide 

5 -GAA TCG GGC CGC CG A GAT CT C ATA TGG AGC TGA GAG TC-3 ' 
(SEQ ID NO: 39) 

25 was used to introduce Bglll (underlined in SEQ ID NO; 39) and Ndel (in italics 
in SEQ ID NO: 39) sites at the initiating ATG of CKISHu. One plasmid 
construction employed BgHl/Sacl-digcsied CKI DNA from the mutagenized cDNA 
that was ligated into Bglll/ Sacl-digcsicd pRS305(N) 2fi GALl to produce 
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pRS305(CKI6). The second plasmid construct employed Ncol/ Sacl-digestcd 
CKISHu cDNA from unmutagenized pEC7B cDNA that was ligated into 
MroI/5flcI-digested pRS305(N) Ifi GALl to produce pRS305(N)(CKI6). Plasmid 
pRS305(CK[5) contained the nucleotides 

5 5 '-CCC GGA TCT AGC AGA TCT CAT-3 ' (SEQ ID NO: 40) 



between the GALl promoter and the initiating methionine of CKI5. Plasmid 
pRS305(N)(CKI5) had a near-perfect fusion between the initiating methionine of 
CKISHu and the 3' end of GALl . Near perfect fusion indicates that the promoter 
and initiating methionine codon have few or no intervening nucleic acid 

10 sequences, and therefore are approximately abutting. 

The CKIoflHu and CKISHu-containing plasmids were transformed 
into yeast strains 7D and YI227 and were examined for their ability to 
complement for their mutant defects. Like CKI^Hu, CKIalHu partially 
complemented the growth defect associated with HRR25 mutations. CKISHu was 

15 able to complement for the growth defect of temperature-conditional CKI strains, 
for the growth defect of HRR25 mutants, and for the DNA repair defect of 
HRR25. The ability of CKISHu to complement for mutant defects in these yeast 
strains was indistinguishable from yeast HRR25 or CKI genes only when the 
appropriate plasmid construct was employed. Plasmid pRS305(CKI5), which 

20 contained the additional 21 bases was unable to complement for any mutant 
phenotypes, while the near-perfect fusion in pRS305(N)(CKIS) was fully 
functional. This difference was attributed to the inability of yeast to translate 
extended and/or CG rich leader sequences. 
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Example 15 
Generation of Monoclonal Antibodies 
A. CKIaHu Peptides 

Monoclonal antibodies were raised against the following peptides. 
5 SEQ ID NO: 41 was derived from the common amino terminus of CKIalHu, 
CKIq£2Hu, and CKIaSHu, and SEQ ID NO: 42 was derived from an internal 
alternative splice region in CKIaSHu. 

NH2-ASSSGSKAEFIVGGY-COOH (SEQ ID NO: 41) 
NH2-RSMTVSTSQDPSFSGY-COOH (SEQ ID NO: 42) 

10 These peptides were initially each coupled to bovine gamma globulin (Sigma, St 
Louis, MO). Five mg of gamma globulin and 5 mg of peptide were resuspended 
in 0.4 ml 100 mM K2HPO4 (pH 7.2) and to this mixture, 35 mg l-ethyl-3(3- 
dimethylamino propyl)-carbodiimide-HCl (EDC, Pierce) previously dissolved in 
50 /xl K2HPO4 (pH 7.2) was added. The reaction was allowed to proceed for 16 

15 hr at 4°C and was quenched by addition of 0.25 ml 2 M ethanolamine and 0.25 
ml acetic acid. The reaction mixture was then diluted to a final volume of 2.5 ml 
with PBS and desalted using Sephadex G-25M (Pharmacia) chromatography. 
Protein containing fractions were concentrated by centrifugal microconcentration 
(Amicon). Mice were then injected with 50 fig of the coupled peptide nine times 

20 over a period of 8 months. Antibody production was measured against the 
respective peptides by ELISA. 

Fusions were performed by standard methods. Briefly, a single-cell 
suspension was formed by grinding the spleen between the frosted ends of two 
glass microscope slides submerged in serum free RPMI 1640 media, 

25 supplemented with 2 mM L-glutamine, 1 mM sodium pyruvate, 100 units/ ml 
penicillin, and 100 /xg/ml streptomycin (RPMI) (Gibco), The cell suspension was 
filtered through a sterile 70-mesh Nitex cell strainer (Becton Dickinson, 
Parsippany, NJ), and washed twice by centrifuging at 200 g for 5 minutes and the 
pellet resuspended in 20 ml serum free RPMI. Thymocytes taken from 3 naive 

30 Balb/c mice were prepared in a similar manner. 
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NS-1 myeloma cells, kept in log phase in RPMI with 11% fetal 
bovine serum (FBS) (Hyclone, Laboratories, Inc., Logan, Utah) for three days 
prior to fusion, were centrifuged at 200 g for 5 minutes, and the pellet was 
washed twice as described in the foregoing paragraph. After washing, each cell 
5 suspension was brought to a final volume of 10 ml in serum free RPMI, and 10 
^1 was diluted 1:100. From each dilution, 20 fil was removed, mixed with 20 /xl 
0,4% trypan blue stain in 0.85% saline (Gibco), loaded onto a hemacytometer 
(Baxter Healthcare Corp., Deerfield, IL) and cells counted. 

Approximately 2 x 10^ spleen cells were combined with 4 x 10^ 

10 NS-1 cells, centrifuged, and the supernatant was aspirated. The cell pellet was 
dislodged by tapping the tube and 2 ml of 37**C PEG 1500 (50% in 75 mM 
Hepes, Ph 8.0) (Boehringer Mannheim) was added with stirring over the course 
of 1 minute, followed by adding 14 ml of serum free RPMI over 7 minutes. An 
additional 16 ml RPMI was added and the cells were centrifuged at 2O0 g for 10 

15 minutes. After discarding the supernatant, the pellet was resuspended in 200 ml 
RPMI containing 15% FBS, 100 fiM sodium hypoxanthine, 0.4 ^M aminopterin, 
16 fiM thymidine (HAT) (Gibco), 25 units/ml IL-6 (Mallinckrodt, St. Louis, MO) 
and 1.5 X 10^ thymocytes/ml. The suspension was dispensed into ten 96-well flat 
bottom tissue culture plates (Coming, Essex, United Kingdom) at 200 ptl/well. 

20 Cells in the plates were fed 2-3 times between fusing and screening by aspirating 
approximately half of the medium from each well with an 18 gauge needle 
(Becton Dickinson), and replenishing plating medium described above except 
containing 10 units/ml IL-6 and lacking thymocytes. 

Fusions were screened when cell growth reached 60-80% 

25 confluency (usually 7-9 days). Fusion 75 was screened by ELISA on either die 
common amino terminal peptide (SEQ ID NO: 41) or the internal peptide (SEQ 
ID NO: 42), and fusion 80 was screened on the amino terminal peptide (SEQ ID 
NO: 41) only. Immulon 4 plates (Dynatech, Cambridge, MA) were coated at 
4**C overnight with 100 ng/well peptide in 50 mM carbonate buffer, Ph 9.6. 

30 Plates were washed three times with PBS containing 0,05 % Tween 20 (PBST) and 
50 /il culture supernatant was added. After incubation at 37''C for 30 minutes, 
and washing as above, 50 ftl horseradish peroxidase conjugated goat anti-mouse 
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IgG(fc) (Jackson ImmunoResearch, West Grove, PA) diluted 1:3500 in PBST was 
added. Plates were incubated as above, washed four times with PBST and 100 
fil substrate, consisting of 1 mg/ml o-phenylene diamine (Sigma) and 0.1 /xl/ml 
30% H2O2 in 100 mM citrate, pH 4.5, was added. The color reaction was 
5 stopped in 5 minutes with the addition of 50 /xl of 15% H2SO4. Absorbance at 
490 nm was read on a plate reader (Dynatech). 

Three wells from each fusion (designated 75D3G, 75C10H, 75C2g, 
80G10H, 80H4F, and 80J9E) were cloned two to three times, successively, by 
doubling dilution in RPMI, 15% FBS, lOOptM sodium hypoxanthine, 16 ^tM 
10 thymidine and 10 units/ ml IL-6. Wells of clone plates were scored visually after 
4 days and the number of colonies in the least dense wells were recorded. 
Selected wells of each cloning were tested by ELISA as above. In the final 
cloning, positive wells containing single colonies were expanded in RPMI with 
11% FBS. 

15 Three antibodies were determined to be reactive for the peptide 

raised against the amino terminus of CKIaHu (80 GlOHllD, 80 H12F12B, and 
80 J9E10C), and three antibodies were reactive with the peptide raised against the 
internal fragment of CKIa3Hu (75 D3G10A, 75 CIOHID, and 75 C2G11F). 
Clones 75D3G, 75C10H, 75C2G, and 80G10H were isotyped to be IgGl, clone 

20 80H4F IgG3, and 80J9E IgG2a. 

B. CKIHu/Thioredoxin Fusion Proteins 

Expression plasmids were constructed in order to express the 
CKIHu isoforms as fusion proteins with thioredoxin. Specifically, the coding 
sequence for each isoform was amplified by PGR with primers which created a 
25 5 Xbdl restriction site and a 3 'BamHl site. The primer used to create the Xbal 
site for the CKlaHu isoforms is set out in SEQ ID NO: 43 with the Xbal site 
underlined. 



5 -T ACA TCT AGA ATT ATG GCG AGT AGC AGC GGC-3 ' 
(SEQ ID NO: 43) 
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The primer used to create the 3 'BaniHl site in the CKIalHu coding sequence is 
set out in SEQ ID NO: 44, with BamKl site underlined. 

5 '-AAT GGA TCC TTA GAA ACC TGT GGG GGT-3 ' 
(SEQ ID NO: 44) 

5 The primer used to create the BamUl site in the CKIa2Hu and CKIaSHu coding 
sequences is set out in SEQ ID NO: 45, with the BamHl site underlined. 

5 '-AAT GGA TCC TTA GAA ACC TTT CAT GTT ACT CTT GGT-3 ' 
(SEQ ID NO: 45) 

The Xbal and BamHl sites were created in the CKISHu coding sequences with 
10 primers set out in SEQ ID NOS: 46 and 47, respectively. 

5 '-T ACA TCT AGA ATT ATG GAG CTG AGA GTC GGG-5 ' 
(SEQ ID NO: 46) 

5 - GGA TCC TCA TCG GTG CAC GAC AGA CTG-3 ' 
(SEQ ID NO:47) 

15 The primers used to create the Xbal and BamHl sites in the coding regions of the 
CKI7HU isoforms are set out in SEQ ID NO: 48 and 49. 

5 'T ACA TCT AGA ATT ATG GCA CGA CCT AGT GGT CGA TCG-3 ' 
(SEQ ID NO: 48) 

5 -G G GG ATC CT A CTT CAG TAG GGG CTG-3 ' 
20 (SEQ ID NO: 49) 

Digestion of the resulting PCR products with Xbal and BamHl allowed the 
fragments to be directionly cloned in frame at the carboxy terminus of sequences 
encoding thioredoxin in plasmid pTRXFUS [LeVallie, et al., 
Nature/Biotechnology ii: 187-193 (1993)]. The resulting expression constructions 
25 contained the laq Iq gene, followed by the tacll promoter (from plasmid pMal-c2, 
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New England Biolabs, Beverly, MA) which drives expression of the E.coli 
thioredoxin gene fused at the amino termini of the CKl catalytic domains. 

E.coli XL-1 Blue cells (Stratagene) were transformed with the 
individual expression plasmids by standard methods and grown at SV^C to mid-log 
5 phase. Samples were collected to serve as controls for uninduced cells and the 
remaining cells were induced for four hours with 0.25 mM IPTG at 37*^0. Cells 
were then lysed and inclusion bodies in the insoluble extract from cleared lysate 
were used to inject mice. 



C. Other CKI Peptides 

10 Monoclonal antibodies were also raised against other CKI peptides 

coupled to bovine gamma globulin as in section A of this example. Peptides 
derived from the amino termini of the CKI7HU isoforms are set out in SEQ ID 
NOS: 50 and 51; peptides derived from the amino termini of bovine CKIiS 
[Rowles, et aL, supra] are set out in SEQ ID NOS: 52 and 53; peptides derived 

15 from the amino terminus and carboxy terminus of CKISHu are set out in SEQ ID 
NOS: 54 and 55, respectively; a peptide derived from the carboxy termini of 
CKIq:2Hu and CKlQr3Hu is set out in SEQ ID NO: 56; and a peptide common to 
all CKIHu isoforms is set out in SEQ ID NO: 57. The common CKI sequence 
set out in SEQ ID NO: 57 was also injected into rabbits to produce polyclonal 

20 antisera. 



NH2-RSGHNTRGTGSS-COOH (SEQ ID NO: 50) 

NH2-RLGHNTRGTGSS-COOH (SEQ ID NO: 51) 

NH2-SSRPKTDVLVG-COOH (SEQ ID NO: 52) 
NH2-KSDNTKSEMKHS-COOH (SEQ ID NO: 53) 

25 NH2-GTDIAAGE-COOH (SEQ ID NO: 54) 

NH2-ERRDREERLR-COOH (SEQ ID NO: 55) 
NH2-TGKQTDKTKSNMKGY-COOH (SEQ ID NO: 56) 
NH2-DLLGPSLEDLFGY-COOH (SEQ ID NO: 57) 
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Mice were injected with 50 fig of the peptide/gamma globulin complex on a 
varying schedule over a period of eight months. 

Subsequent to the filing of U.S. Patent Application Serial No, 
07/728,783 on July 3, 1991, there have been numerous reports in the scientific 

5 literature of the isolation of DNAs encoding HRR25-l\ke proteins. For example, 
Rowles, et al (Proc. Natl Acad. ScL USA, 88:9548-9592, 1991) reported the 
purification of a bovine thymus casein kinase I (CKI) enzyme. The sequencing 
of tryptic fragments reveled nearly 25 % of the primary sequence of the enzyme. 
PGR cloning resulted in development of partial clones coding for the CKI enzyme 

10 isolate and a homologue enzyme referred to as CKI-5. Screening of bovine brain 
libraries with the partial clones yielded full length cDNAs for the CKI isolate 
(designated CKIof) and two additional homologues (CKI)3 and CKI7). The 
deduced sequence for bovine CKIa was noted by Rowles, et al., [supra] to be 
60% homologous to HRK25 over its catalytic domain. As noted earlier, a 

15 comparison of the bovine CKIar sequence of Rowles, et aL to human CKIal 
sequence set out in SEQ. ID. NO. 7 and 8 reveals 100% homology in the 
catalytic domain. 

As another example, Robinson, et aL (Proc. Natl Acad, ScL USA, 
89:28-32, 1992) describes the isolation of two Saccharomyces cerevisiae genes, 

20 YCKl and YCK2 which encode yeast casein kinase 1 homologues and also 
describes purification and partial sequencing of a rabbit casein kinase I from a 
rabbit reticulocyte lysate preparation. HRR25 was noted to be 50% homologous 
to YCKl and YCK2 and 60% homologous to the partial rabbit CKI sequence. 
As a further example, Wang, et aL (Molecular Biology of the Cell, 3:275-286, 

25 1992) describes the isolation of a 54 kDa CKI from S. cerevisiae and the use of 
amino acid sequence information therefrom for cloning two yeast cDNAs 
encoding homologous casein kinase I proteins, CKIl and CKI2. Comparison of 
the catalytic domains of the protein encoded by the CKIl gene produced few 
alignments revealing greater than 20-25% homology. The closest matches were 

30 with HRR25 (50-56%) and with the three bovine isozymes of Rowles, et aL (51- 
56%). The YCKl sequence of Robinson, et aL corresponds to the CKI2 
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sequence of Wang, et aL ; the YCK2 sequence corresponds to CKIl. Brockman, 
€t al (Proc. Natl Acad, Sci. USA, 89:9454-9458, 1992) reported the 
immunopurification and sequencing of a human erythroid casein kinase I and 
noted that it was 62% homologous to HRR25. As a final example, Graves, et aL 
5 (J.BioLChem. 265:6394-6401, 1993) reported the cloning and characterization of 
a casein kinase I from rat testes. This CKI, designated CKI8, shared 76% 
homology at the amino acid level with CKIa isolated from bovine brain and 65% 
homology with HRR25. 

While the foregoing illustrative examples are specifically directed 

10 to isolation of "full length" polynucleotides encoding the HRR25Axke proteins 
HRR25, Hhpl-h, Hhp2-h, CKIalHu, CKIa2Hu, CKIa3Hu, CKISHu, CKIylHu 
and CKI72HU, it will be readily understood that the present invention is not 
limited to those polynucleotides. Rather it embraces all polynucleotides which are 
comprehended within the class of genes encoding flR/f25-like proteins 

15 characterized protein kinase activity and by homology of 35% or more with the 
HRR25 protein through the protein kinase catalytic domain. By way of example, 
employing information concerning the DNA sequence of HRR25^ the procedures 
of Example 7 allowed the isolation partial cDNA clones of expected length from 
cDNA libraries derived from Arabidopsis thaliana, Drosophila melanogaster, 

20 Xenopus, chicken, mouse, rat, and human species. These partial cDNAs may, 
in turn, be employed in the manner of Examples 6 and 7 to isolate full length 
DNA clones encoding HRR25 -like proteins from these species. Each of these 
may be employed in the large scale production of the corresponding proteins by 
recombinant methods or for the generation of other useful polynucleotides such 

25 as antisense RNAs. Recombinant expression products of such HRR25-hkc DNAs 
may be employed for generation of antibodies and in screens for compounds 
which modulate the protein kinase and/or recombination/repair functions of these 
enzymes. Moreover, as suggested in the publication of Rowles, et aL , Robinson, 
et aL, and Wang, et aL , multiple ffly?25-like isozymes are expected to exist in 

30 a variety of eukaryotic species as both membrane bound and cytoplasmic proteins. 
It appears reasonable to expect that a number of genes and gene products exist in 
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human species, all of which are functionally related as well as structurally related 
to each other and to HRR25. 

The invention now being fully described, it will be apparent to one 
of ordinary skill in the art that various changes and modifications can be made 
5 without departing from the spirit or scope of the invention. 
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SUMMARY OF SEQUENCES 

SEQ ID NO: 1 is the nucleic acid sequence and the deduced amino 
acid of a genomic fragment encoding a yeast-derived protein kinase, HKR25 of 
the present invention. 

5 SEQ ID NO: 2 is the deduced amino acid sequence of a yeast- 

derived protein kinase HRR25 of the present invention. 

SEQ ID NO: 3 is the nucleic acid sequence (and the deduced amino 
acid sequence) of a genomic fragment encoding Hhpl+ of the present invention. 

SEQ ID NO: 4 is the deduced amino acid sequence of Hhpl-+- of 
10 the present invention. 

SEQ ID NO: 5 is the nucleic acid sequence (and the deduced amino 
acid sequence) of a genomic fragment encoding Hhp2+ of the present invention. 

SEQ ID NO: 6 is the deduced amino acid sequence of Hhp2-I- of 
the present invention. 

15 SEQ ID NO: 7 is the nucleic acid sequence (and the deduced amino 

acid sequence) of a genomic fragment encoding CKlalHu of the present 
invention. 

SEQ ID NO: 8 is the deduced amino acid sequence of CKlalHu 
of the present invention. 

20 SEQ ID NO: 9 is the nucleic acid sequence (and the deduced amino 

acid sequence) of a genomic fragment encoding CKla2Hu of the present 
invention. 
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SEQ ID NO: 10 is the deduced amino acid sequence of CKlQf2Hu 
of the present invention. 

SEQ ID NO: 11 is the nucleic acid sequence (and the deduced 
amino acid sequence) of a genomic fragment encoding CKla3Hu of the present 
5 invention. 

SEQ ID NO: 12 is the deduced amino acid sequence of CKlorSHu 
of the present invention. 

SEQ ID NO: 13 is the primer, 4583, representing top strand DNA 
encoding residues 16-23 of HRR25. 

10 SEQ ID NO: 14 is the primer, 4582, representing top strand DNA 

encoding residues 126-133 of HRK25, 

SEQ ID NO: 15 is the primer, 4589, representing bottom strand 
- DNA encoding residues 126-133 of HRR25. 

SEQ ID NO: 16 is the primer, 4590, representing bottom strand 
15 DNA encoding residues 194-199 of HRR25, 

SEQ ID NO: 17 is the primer JH21, representing bovine top strand 
DNA bases 47-67. 

SEQ ID NO: 18 is the primer JH22, representing bovine top strand 
DNA bases 223-240. 

20 SEQ ID NO: 19 is the primer JH29, representing bovine top strand 

DNA bases 604-623. 
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SEQ ID NO: 20 is the primer JH30, representing bovine bottom 
strand DNA bases 623-604. 

SEQ ID NO: 21 is the primer JH31, representing bovine bottom 
strand DNA bases 835-817. 

5 SEQ ID NO: 22 is the mutated HRR25 kinase domain primer found 

on p. 33, Example 3. 

SEQ ID NO: 23 is the nucleic acid sequence (and the deduced 
amino acid sequence) of a genomic fragment encoding NUFl of the present 
invention, 

10 SEQ ID NO: 24 is the deduced amino acid sequence of NUFl of 

the present invention. 

SEQ ID NOS: 25, 26 and 27 are the conserved motifs found on 

page 18. 

SEQ ID NOS: 28 and 29 are redundant oligonucleotides, based on 
15 conserved regions of HRR25-like proteins, used to amplify a probe from a human 
cDNA library. 



SEQ ID NO: 30 is the nucleotide sequence of the CKI7IHU gene. 



SEQ ID NO: 31 is the deduced amino acid sequence of the 
CKI7IHU protein. 

20 SEQ ID NO: 32 is the nucleotide sequence of the CKI72HU gene. 
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SEQ ID NO: 33 is the deduced amino acid sequence of the 
CKI72HU protein. 

SEQ ID NO: 34 is the nucleic acid sequence for CKISHu. 

SEQ ID NO: 35 is the deduced amino acid sequence for CKISHu. 

5 SEQ ID NO: 36 is the mutagenic oligonucleotide used to generate 

an Ncol restriction site in expression plasmid pRS305. 

SEQ ID NO: 37 is the mutagenic oligonucleotide used to generate 
an Ncol restriction site in CKI7I. 

SEQ ID NO: 38 is the mutagenic oligonucleotide used to create an 
10 iVcoI- restriction site in human CKIaa. 

SEQ ID NO: 39 is the mutagenic oligonucleotide used to introduce 
a BglH restriction site in CKI5. 

SEQ ID NO: 40 is the intervening nucleic acids sequence between 
the GALl promoter and initiating methionine codon in the CKI6 expression 
15 plasmid. 

SEQ ID NOS: 41 and 42 are amino terminal and internal peptide 
fragments of CKIor isoforms to generate monoclonal antibodies. 

SEQ ID NO: 43 is the primer used to create a Xbal restriction site 
in CKIaHu coding sequences. 

20 SEQ ID NO: 44 is the primer used to create a BamRl restriction 

site in the CKIalHu coding sequence. 
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SEQ ID NO: 45 is the primer used to create a BaniHl restriction 
site in the CKIa2Hu and CKIaSHu coding sequences. 

SEQ ID NO: 46 is the primer used to create a Xbal restriction site 
in the CKISHu coding sequence. 

5 SEQ ID NO: 47 is the primer used to create a BamHl restriction 

site in the CKISHu coding sequence. 

SEQ ID NO: 48 is the primer used to create a Xbal restriction site 
in the CKI7IHU and CKI 72HU coding sequences. 

SEQ ID NO: 49 is the primer used to create a BcunBl restriction 
10 site in the CKI7IHU and CKI 72Hu coding sequences. 

SEQ ID NO: 50 is an amino terminal peptide fragment of CKI7HU 
coupled to bovine gamma globulin and used to generate monoclonal antibodies in 
mice. 

SEQ ID NO: 51 is an amino terminal peptide fragment of CKI7HU 
15 coupled to bovine gamma globulin and used to generate monoclonal antibodies in 
mice. 

SEQ ID NO: 52 is an amino terminal peptide fragment of bovine 
CKIjS coupled to bovine gamma globulin and used to generate monoclonal 
antibodies in mice. 

20 SEQ ID NO: 53 is an amino terminal peptide fragment of bovine 

CK10 coupled to bovine gamma globulin and used to generate monoclonal 
antibodies in mice. 
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SEQ ID NO: 54 is an amino terminal peptide fragment of CKISHu 
coupled to bovine gamma globulin and used to generate monoclonal antibodies in 
mice. 

SEQ ID NO: 55 is a carboxy terminal peptide ftagment of CKISHu 
5 coupled to bovine gamma globulin and used to generate monoclonal antibodies in 
mice. 

SEQ ID NO: 56 is an carboxy terminal peptide fragment of 
CKIa2Hu and CKIaSHu coupled to bovine gamma globulin and used to generate 
monoclonal antibodies in mice. 

10 SEQ ID NO: 57 is an internal terminal peptide fragment common 

to all human CKI isoforms coupled to bovine gamma globulin and used to 
generate monoclonal antibodies in mice. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: The Salk Institute For Biological Studies 
(ii) TITLE OF INVENTION: Protein Kinases 
(iii) NUMBER OF SEQUENCES: 57 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Meurshall, O' Toole, Ger stein, Murray & Borun 

(B) STREET: 233 South Wacker Drive, 6300 Sears Tower 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: USA 

(F) ZIP: 60606-6402 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/008,001 

(B) FILING DATE: 21-JAN-1993 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/728,783 

(B) FILING DATE: 03-JUL-1991 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Noland, Greta E. 

(B) REGISTRATION NUMBER: 35,302 

(C) REFERENCE/DOCKET NUMBER: 27866/31853 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312-474-6300 

(B) TELEFAX: 312-474-0448 

(C) TELEX: 25-3856 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3098 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 
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( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 879.. 2360 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GTCGACTCGC CAATCACCAA GTTCTTATCC CACATCCGAC CAGTGTCTGA GTCATGGTTT 60 

ACCACCACCA TACCATCGCT GGTCATTTGT AAATCCGTTT CTATTACATC AGCACCTGCT 120 

GCATAAGCCT TCTCAAATGC TAGTAGCGTA TTTTCAGGAT ATCTTGCTTT AAAAGCTCTG 180 

TGGCCCACAA TTTCAACCAT CCTCGTGTCC TTGTTGTTAT CTTACACTTC TTATTTATCA 240 

ATAACACTAG TAACATCAAC AACACCAATT TTATATCTCC CTTAATTGTA TACTAAAAGA 300 

TCTAAACCAA TTCGGTATTG TCCTCGATAC GGCATGCGTA TAAAGAGATA TAATTAAAAG 360 

AGGTTATAGT CACGTGATGC AGATTACCCG CAACAGTACC ACAAAATGGA TACCATCTAA 420 

TTGCTATAAA AGGCTCCTAT ATACGAATAA CTACCACTGG ATCGACGATT* ATTTCGTGGC 480 

AATCATATAC CACTGTGAAG AGTTACTGCA ACTCTCGCTT TGTTTCAACG CTTCTTCCCG 540 

TCTGTGTATT TACTACTAAT AGGCAGCCCA CGTTTGAATT TCTTTTTTTC TGGAGAATTT 600 

TTGGTGCAAC GAGGAAAAGG AGACGAAGAA AAAAAGTTGA AACACGACCA CATATATGGA 660 

ACGTGGTTGA AATACAAAGA GAAGAAAGGT TCGACACTCG AGGAAAGCAT TTGGTGGTGA 720 

AAACACATCT TAGTAGCATC TTTAAACCTC TGTTGGGTAC TTAGAAAAAT ATTTCCAGAC 780 

TTCAAGGATA AAAAAAGTCG AAAAGTTACG ACATATTCGA CCAAAAAAAA AAACCAAAAA 840 

GAAAAGATAT ATTTATAGAA AGGATACATT AAAAAGAG ATG GAC TTA AGA GTA 893 

Met Asp Leu Arg Val 
1 5 

GGA AGG AAA TTT CGT ATT GGC AGG AAG ATT GGG AGT GGT TCC TTT GGT 941 
Gly Arg Lys Phe Arg lie Gly Arg Lys lie Gly Ser Gly Ser Phe Gly 
10 15 20 

GAC ATT TAC CAC GGC ACG AAC TTA ATT AGT GGT GAA GAA GTA GCC ATC 989 
Asp lie Tyr His Gly Thr Asn Leu lie Ser Gly Glu Glu Val Ala lie 
25 30 35 

AAG CTG GAA TCG ATC AGG TCC AGA CAT CCT CAA TTG GAC TAT GAG TCC 1037 
Lys Leu Glu Ser lie Arg Ser Arg His Pro Gin Leu Asp Tyr Glu Ser 
40 45 50 

CGC GTC TAC AGA TAC TTA AGC GGT GGT GTG GGA ATC CCG TTC ATC AGA 1085 
Arg Val Tyr Arg Tyr Leu Ser Gly Gly Val Gly lie Pro Phe lie Arg 
55 60 65 

TGG TTT GGC AGA GAG GGT GAA TAT AAT GCT ATG GTC ATC GAT CTT CTA 1133 
Trp Phe Gly Arg Glu Gly Glu Tyr Asn Ala Met Val lie Asp Leu Leu 
70 75 80 85 

GGC CCA TCT TTG GAA GAT TTA TTC AAC TAC TGT CAC AGA AGG TTC TCC 1181 
Gly Pro Ser Leu Glu Asp Leu Phe Asn Tyr Cys His Arg Arg Phe Ser 
90 95 100 

TTT AAG ACG GTT ATC ATG CTG GCT TTG CAA ATG TTT TGC CGT ATT CAG 1229 
Phe Lys Thr Val He Met Leu Ala Leu Gin Met Phe Cys Arg He Gin 
105 110 115 
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TAT ATA CAT GGA AGG TCG TTC ATT CAT AG A GAT ATC AAA CCA GAC AAC 12 7 7 

Tyr He His Gly Arg Ser Phe He His Arg Asp He Lys Pro Asp Aen 
120 125 130 

TTT TTA ATG GGG GTA GGA CGC CGT GGT AGC ACC GTT CAT GTT ATT GAT 1325 
Phe Leu Met Gly Val Gly Arg Arg Gly Ser Thr Val His Val lie Asp 
135 140 145 

TTC GGT CTA TCA AAG AAA TAC CGA GAT TTC AAC ACA CAT CGT CAT ATT 1373 
Phe Gly Leu ser Lys Lys Tyr Arg Asp Phe Asn Thr His Arg His He 
150 155 160 165 

CCT TAC AGG GAG AAC AAG TCC TTG ACA GGT ACA GCT CGT TAT GCA AGT 1421 
Pro Tyr Arg Glu Asn Lys Ser Leu Thr Gly Thr Ala Arg Tyr Ala Ser 
170 175 180 

GTC AAT ACG CAT CTT GGA ATA GAG CAA AGT AG A AGA GAT GAC TTA GAA 1469 
Val Asn Thr His Leu Gly He Glu Gin Ser Arg Arg Asp Asp Leu Glu 
185 190 195 

TCA CTA GGT TAT GTC TTG ATC TAT TTT TGT AAG GGT TCT TTG CCA TGG 1517 
Ser Leu Gly Tyr Val Leu He Tyr Phe Cys Lys Gly Ser Leu Pro Trp 
200 205 210 

CAG GGT TTG AAA GCA ACC ACC AAG AAA CAA AAG TAT GAT CGT ATC ATG 1565 
Gin Gly Leu Lys Ala Thr Thr Lys Lys Gin Lys Tyr Asp Arg He Met 
215 220 225 

GAA AAG AAA TTA AAC GTT AGC GTG GAA ACT CTA TGT TCA GGT TTA CCA 1613 
Glu Lys .Lys Leu Asn Val Ser Val Glu Thr Leu Cys Ser Gly Leu Pro 
230 235 240 245 

TTA GAG TTT CAA GAA TAT ATG GCT TAC TGT AAG AAT TTG AAA TTC GAT 1661 
Leu Glu Phe Gin Glu Tyr Met Ala Tyr Cys Lys Asn Leu Lys Phe Asp 
250 255 260 

GAG AAG CCA GAT TAT TTG TTC TTG GCA AGG CTG TTT AAA GAT CTG AGT 1709 
Glu Lys Pro Asp Tyr Leu Phe Leu Ala Arg Leu Phe Lys Asp Leu Ser 
265 270 275 

ATT AAA CTA GAG TAT CAC AAC GAC CAC TTG TTC GAT TGG ACA ATG TTG 1757 
He Lys Leu Glu Tyr His Asn Asp His Leu Phe Asp Trp Thr Met Leu 
280 285 290 

CGT TAC ACA AAG GCG ATG GTG GAG AAG CAA AGG GAC CTC CTC ATC GAA 1805 
Arg Tyr Thr Lys Ala Met Val Glu Lys Gin Arg Asp Leu Leu He Glu 
295 300 305 

AAA GGT GAT TTG AAC GCA AAT AGC AAT GCA GCA AGT GCA AGT AAC AGC 1853 
Lys Gly Asp Leu Asn Ala Asn Ser Asn Ala Ala Ser Ala Ser Asn Ser 
310 315 320 325 

ACA GAC AAC AAG TCT GAA ACT TTC AAC AAG ATT AAA CTG TTA GCC ATG 1901 
Thr Asp Asn Lys Ser Glu Thr Phe Asn Lys He Lys Leu Leu Ala Met 
330 335 340 

AAG AAA TTC CCC ACC CAT TTC CAC TAT TAC AAG AAT GAA GAC AAA CAT 1949 
Lys Lys Phe Pro Thr His Phe His Tyr Tyr Lys Asn Glu Asp Lys His 
345 350 355 

AAT CCT TCA CCA GAA GAG ATC AAA CAA CAA' ACT ATC TTG TiAT AAT AAT 1997 
Asn Pro Ser Pro Glu Glu He Lys Gin Gin Thr He Leu Asn Asn Asn 
360 365 370 

GCA GCC TCT TCT TTA CCA GAG GAA TTA TTG AAC GCA CTA GAT AAA GGT 2045 
Ala Ala Ser Ser Leu Pro Glu Glu Leu Leu Asn Ala Leu Asp Lys Gly 
375 380 385 
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ATG GAA AAC TTG AGA CAA CAG CAG CCG CAG CAG CAG GTC CAA AGT TCG 2093 
Met Glu Asn Leu Arg Gin Gin Gin Pro Gin Gin Gin Val Gin Ser Ser 
390 395 400 405 

CAG CCA CAA CCA CAG CCC CAA CAG CTA CAG CAG CAA CCA AAT GGC CAA 2141 
Gin Pro Gin Pro Gin Pro Gin Gin Leu Gin Gin Gin Pro Asn Gly Gin 
410 415 420 

AGA CCA AAT TAT TAT CCT GAA CCG TTA CTA CAG CAG CAA CAA AGA GAT 2189 
Arg Pro Asn Tyr Tyr Pro Glu Pro Leu Leu Gin Gin Gin Gin Arg Asp 
425 430 435 

TCT CAG GAG CAA CAG CAG CAA GTT CCG ATG GCT ACA ACC AGG GCT ACT 2237 
Ser Gin Glu Gin Gin Gin Gin Val Pro Met Ala Thr Thr Arg Ala Thr 
440 445 450 

CAG TAT CCC CCA CAA ATA AAC AGC AAT AAT TTT AAT ACT AAT CAA GCA 2285 
Gin Tyr Pro Pro Gin lie Asn Ser Asn Asn Phe Asn Thr Asn Gin Ala 
455 460 465 

TCT GTA CCT CCA CAA ATG AGA TCT AAT CCA CAA CAG CCG CCT CAA GAT 2333 
Ser Val Pro Pro Gin Met Arg Ser Asn Pro Gin Gin Pro Pro Gin Asp 
470 475 480 485 

AAA CCA GCT GGC CAG TCA ATT TGG TTG TAAGCAACAT ATATTGCTCA 2380 
Lys Pro Ala Gly Gin Ser lie Trp Leu 
490 



AAACGCACAA 


AAATAAACAT 


ATGTATATAT 


AGACATACAC 


ACACACATAT 


ATATATATAT 


2440 


ATTATTATTA 


TTATTTACAT 


ATACGTACAC 


ACAATTCCAT 


ATCGAGTTAA 


TATATACAAT 


2500 


TCTGGCCTTC 


TTACCTAAAA 


AGATGATAGC 


TAAAAGAACC 


ACTTTTTTTA 


TGCATTTTTT 


2560 


TCTTCGGGAA 


GGAAATTAAG 


GGGGAGCGGA 


GCACCTCTTG 


GCCAATTTGT 


TTTTTTTTTA 


2620 


TGTAATAAAG 


GGCTAACGAT 


CGAAGATCAA 


TCACGAATAT 


TGGACGGTTT 


TAAAGGAGGG 


2680 


CCTCTGAGAA 


GACAGCATCA 


ATTCGTATTT 


TCGATAATTA 


ACTTCCCTTA 


TAGTGTCTGA 


2740 


TTAGGAAACA 


ATCACGAGAC 


GATAACGACG 


GAATACCAAG 


GAAGTTTGTG 


CAAATATACA 


2800 


GCCGGCACAA 


ACAGCAGCTT 


CACTCAGGTT 


AACTCACATA 


CTGTTGAAAA 


TTGTCGGTAT 


2860 


GGAATTCGTT 


GCAGAAAGGG 


CTCAGCCAGT 


TGGTCAAACA 


ATCCAGCAGC 


AAAATGTTAA 


2920 


TACTTACGGG 


CAAGGCGTCC 


TACAACCGCA 


TCATGATTTA 


CAGCAGCGAC 


AACAACAACA 


2980 


ACAGCAGCGT 


CAGCATCAAC 


AACTGCTGAC 


GTCTCAGTTG 


CCCCAGAAAT 


CTCTCGTATC 


3040 


CAAAGGCAAA 


TATACACTAC 


ATGACTTCCA 


GATTATGAGA 


ACGCTTGGTA 


CTGGATCC 


3098 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 494 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECtJLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Asp Leu Arg Val Gly Arg Lys Phe Arg lie Gly Arg Lys lie Gly 
15 10 15 
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Ser Gly Ser Phe Gly Asp lie Tyr His Gly Thr Asn Leu lie Ser Gly 
20 25 30 

Glu Glu Val Ala lie Lys Leu Glu Ser lie Arg Ser Arg His Pro Gin 
35 40 45 

Leu Asp Tyr Glu Ser Arg Val Tyr Arg Tyr Leu Ser Gly Gly Val Gly 
50 55 60 

lie Pro Phe lie Arg Trp Phe Gly Arg Glu Gly Glu Tyr Asn Ala Met 
65 70 75 80 

Val lie Asp Leu Leu Gly Pro Ser Leu Glu Asp Leu Phe Asn Tyr Cye 
85 90 95 

His Arg Arg Phe Ser Phe Lys Thr Val He Met Leu Ala Leu Gin Met 
100 105 110 

Phe Cys Arg He Gin Tyr He His Gly Arg Ser Phe He His Arg Asp 
115 120 125 

He Lys Pro Asp Asn Phe Leu Met Gly Val Gly Arg Arg Gly Ser Thr 
130 135 140 

Val His Val He Asp Phe Gly Leu Ser Lys Lys Tyr Arg Asp Phe Asn 
145 150 155 160 

Thr His Arg His He Pro Tyr Arg Glu Asn Lys Ser Leu Thr Gly Thr 
165 170 175 

Ala Arg Tyr Ala Ser Val Asn Thr His Leu Gly He Glu Gin Ser Arg 
180 185 190 

Ar-g Asp Asp Leu Glu Ser Leu Gly Tyr Val Leu He Tyr Phe Cys Lys 
195 200 205 

Gly Ser Leu Pro Trp Gin Gly Leu Lys Ala Thr Thr Lys Lys Gin Lys 
210 215 220 

Tyr Asp Arg He Met Glu Lys Lys Leu Asn Val Ser Val Glu Thr Leu 
225 230 235 240 

Cys Ser Gly Leu Pro Leu Glu Phe Gin Glu Tyr Met Ala Tyr Cys Lys 
245 250 255 

Asn Leu Lys Phe Asp Glu Lys Pro Asp Tyr Leu Phe Leu Ala Arg Leu 
260 265 270 

Phe Lys Asp Leu Ser He Lys Leu Glu Tyr His Asn Asp His Leu Phe 
275 280 285 

Asp Trp Thr Met Leu Arg Tyr Thr Lys Ala Met Val Glu Lys Gin Arg 
290 295 300 

Asp Leu Leu He Glu Lys Gly Asp Leu Asn Ala Asn Ser Asn Ala Ala 
305 310 315 320 

Ser Ala Ser Asn Ser Thr Asp Asn Lys Ser Glu Thr Phe Aen Lys He 
325 330 335 

Lys Leu Leu Ala Met Lys Lys Phe Pro Thr His Phe His Tyr Tyr Lys 
340 345 350 

Asn Glu Asp Lys His Asn Pro Ser Pro Glu Glu He Lys Gin Gin Thr 
355 360 365 
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Ile Leu Asn Asn Asn Ala Ala Ser Ser Leu Pro Glu Glu Leu Leu Asn 
370 375 380 

Ala Leu Asp Lys Gly Met Glu Asn Leu Arg Gin Gin Gin Pro Gin Gin 
385 390 395 400 

Gin Val Gin Ser Ser Gin Pro Gin Pro Gin Pro Gin Gin Leu Gin Gin 
405 410 415 

Gin Pro Asn Gly Gin Arg Pro Asn Tyr Tyr Pro Glu Pro Leu Leu Gin 
420 425 430 

Gin Gin Gin Arg Asp Ser Gin Glu Gin Gin Gin Gin Val Pro Met Ala 
435 440 445 

Thr Thr Arg Ala Thr Gin Tyr Pro Pro Gin lie Asn Ser Asn Asn Phe 
450 455 460 

Asn Thr Asn Gin Ala Ser Val Pro Pro Gin Met Arg Ser Asn Pro Gin 
465 470 475 480 

Gin Pro Pro Gin Asp Lys Pro Ala Gly Gin Ser lie Trp Leu 
485 490 



(2) INFORMATION FOR SEQ ID NOs3: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2469 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

<vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

< ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 113.. 1207 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

AATATTTCAA GCTATACCAA GCATACAATC AACTCCAAGC TTCGAGCGGC CGCCAGTGTG 60 

CTCTAAAGGA AAAAGCGAGT GCCTTTAGCC TTAAAAGCGT TATAATATTA TT ATG 115 

Met 
1 

GCT TTG GAC CTC CGG ATT GGG AAC AAG TAT CGC ATT GGT CGT AAA ATT 163 
Ala Leu Asp Leu Arg lie Gly Asn Lys Tyr Arg lie Gly Arg Lys lie 
5 10 15 

GGC AGT GGA TCT TTC GGA GAC ATT TAT CTT GGG ACT AAT GTC GTT TCT 211 
Gly Ser Gly Ser Phe Gly Asp lie Tyr Leu Gly Thr Asn Val Val Ser 
20 25 30 

GGT GAA GAG GTC GCT ATC AAG CTA GAA TCA ACT CGT GCT AAA CAC CCT 259 
Gly Glu Glu Val Ala lie Lys Leu Glu Ser Thr Arg Ala Lys His Pro 
35 40 45 

CAA TTG GAG TAT GAA TAG AGA GTT TAT CGC ATT TTG TCA GGA GGG GTC 307 
Gin Leu Glu Tyr Glu Tyr Arg Val Tyr Arg lie Leu Ser Gly Gly Val 
50 55 60 65 
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GGA ATC CCG TTT GTT CGT TGG TTC GGT GTA GAA TGT GAT TAG AAC GCT 355 

Gly lie Pro Phe Val Arg Trp Phe Gly Val Glu Cys Asp Tyr Aan Ala 
70 75 80 

ATG GTG ATG GAT TTA TTG GGT CCT TCG TTG GAA GAC TTG TTT AAT TTT 403 
Met Val Met Asp Leu Leu Gly Pro Ser Leu Glu Asp Leu Phe Asn Phe 
85 90 95 

TGC AAT CGA AAG TTT TCT TTG AAA ACA GTT CTT CTC CTT GCG GAC CAG 451 
Cys Asn Arg Lys Phe Ser Leu Lye Thr Val Leu Leu Leu Ala Asp Gin 
100 105 110 

CTC ATT TCT CGA ATT GAA TTC ATT CAT TCA AAA TCT TTT CTT CAT CGT 499 
Leu lie Ser Arg lie Glu Phe lie His Ser Lys Ser Phe Leu His Arg 
115 120 125 

GAT ATT AAG CCT GAT AAC TTT TTA ATG GGA ATA GGT AAA AGA GGA AAT 547 
Asp lie Lys Pro Asp Asn Phe Leu Met Gly lie Gly Lys Arg Gly Asn 
130 135 140 145 

CAA GTT AAC ATA ATT GAT TTC GGA TTG GCT AAG AAG TAT CGT GAT CAC 595 
Gin Val Asn lie lie Asp Phe Gly Leu Ala Lys Lys Tyr Arg Asp His 
150 155 160 

AAA ACT CAC CTG CAC ATT CCT TAT CGC GAG AAC AAG AAT CTT ACA GGT 643 
Lys Thr His Leu His lie Pro Tyr Arg Glu Asn Lys Asn Leu Thr Gly 
165 170 175 

ACT GCA CGC TAT GCT AGC ATC AAT ACT CAT TTA GGT ATT GAA CAA TCC 691 
Thr Ala Arg Tyr Ala Ser lie Asn Thr His Leu Gly lie Glu Gin Ser 
180 185 190 

CGC CGT GAT GAC CTC GAA TCT TTA GGT TAT GTG CTC GTC TAC TTT TGT 739 
Arg Arg Asp Asp Leu Glu Ser Leu Gly Tyr Val Leu Val Tyr Phe Cys 
195 200 205 

CGT GGT AGC CTG CCT TGG CAG GGA TTG AAG GCT ACC ACG AAA AAG CAA 787 
Arg Gly Ser Leu Pro Trp Gin Gly Leu Lys Ala Thr Thr Lys Lys Gin 
210 215 220 225 

AAG TAT GAA AAG ATT ATG GAG AAG AAG ATC TCT ACG CCT ACA GAG GTC 835 
Lys Tyr Glu Lys lie Met Glu Lys Lys lie Ser Thr Pro Thr Glu Val 
230 235 240 

TTA TGT CGG GGA TTC CCT CAG GAG TTC TCA ATT TAT CTC AAT TAC ACG 883 
Leu Cys Arg Gly Phe Pro Gin Glu Phe Ser lie Tyr Leu Asn Tyr Thr 
245 250 255 

AGA TCT TTA CGT TTC GAT GAC AAA CCT GAT TAC GCC TAC CTT CGC AAG 931 
Arg Ser Leu Arg Phe Asp Asp Lys Pro Asp Tyr Ala Tyr Leu Arg Lys 
260 265 270 

CTT TTC CGA GAT CTT TTT TGT CGG CAA TCT TAT GAG TTT GAC TAT ATG 979 
Leu Phe Arg Asp Leu Phe Cys Arg Gin Ser Tyr Glu Phe Asp Tyr Met 
275 280 285 

TTT GAT TGG ACC TTG AAG AGA AAG ACT CAA CAA GAC CAA CAA CAT CAG 1027 
Phe Asp Trp Thr Leu Lys Arg Lys Thr Gin Gin Asp Gin Gin His Gin 
290 295 300 305 

CAG CAA TTA CAG CAA CAA CTG TCT GCA ACT CCT CAA GCT ATT AAT CCG 1075 
Gin Gin Leu Gin Gin Gin Leu Ser Ala Thr Pro Gin Ala lie Asn Pro 
310 315 320 

CCG CCA GAG AGG TCT TCA TTT AGA AAT TAT CAA AAA CAA AAC TTT GAT 1123 
Pro Pro Glu Arg Ser Ser Phe Arg Asn Tyr Gin Lys Gin Asn Phe Asp 
325 330 335 
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GAA AAA GGC GGA GAC ATT AAT ACA ACC GTT CCT GTT ATA AAT GAT CCA 1171 
Glu Lys Gly Gly Asp He Asn Thr Thr Val Pro Val He Asn Asp Pro 
340 345 350 



TCT GCA ACC GGA GCT CAA TAT ATC AAC AGA CCT AAT TGATTAGCCT 1217 
Ser Ala Thr Gly Ala Gin Tyr He Asn Arg Pro Asn 



^ c c 




J ou 




JO 3 








TIk. TT Zi T H T 2Vr^ 




m m ^ ni m m m m ^ rn 

X XAX XXI. XAi. 


AXlXXwXXwX 


^AX ^ X wOAV? 1 


x^ / / 




X L 1. 1. X XA 




\a X\^\^ X X X AAl 


X X A w X X w A X A 


uwVrVir AvvVjtW X 


XwO / 


TTTTCCTTGG 


GATGGCGAAA 


GTTAC X 1 X GC 


xxAxAv7X xlA 


X X V7 A&Gv^ T T C 


AT AG CTT AT x 


X J7 / 


TGG CTG AAG A 


TCTTGTGTTG 


ACTxAAAxTC 


X A 1 GCZTAACC 


1*0 ■R T*/^ * *pr^7V *T* 

X C A X GAT CAT 


jv *p/^/^^r^iv •P'PIl 

ATCCTCATTA 


X43 / 


TGGCAAGTTT 


TGGTGAAAAA 


IIIWI* III HI 1*111 IV 

TTTTTTAATA 


TT AG TAG ATT 


TG CT AAT AAT 


ACATTTGGTA 


XbX / 


TTTGTTTTTA 


CTACCTGTGA 


ATCTATTCAT 


ACATTATCAT 


AT ATGTT T CG 


AG C C AGG AAC 


T e T "7 

157 / 


AGAAAAAAGT 


GAGAGAATTT 


TCTGCAGAAA 


TGATCATAAT 


TTTATCTTCG 


CTTAACACGA 


1637 


ATCCTGGTGA 


CAGATTATCG 


TGGTTTAAAG 


CCTTTTTTTT 


ACGACGCCAT 


AAGCAAATTG 


1697 


GTTACTTTTT 


TATGTGTGAT 


GAGCCTTGGG 


GTTTAATCTA 


ATTAGAAGGC 


ATTGCATTCA 


1757 


TATACTTTTA 


ATAATATATT 


ATCAGCTATT 


TGCTGCTTTT 


CTTTATAGAT 


ACCGTCTTTT 


1817 


CCAAGCTGAA 


CTCATTTAAT 


CAGCGTCGTT 


TAACCTTAGG 


ATGCTTAAGA 


TGCGTTTAAA 


1877 


TTCAATGACT 


TAATG CTCG A 


GGGATGAATG 


GTTTGTTTTA 


GTTCGTGTTC 


TGGGTGCATG 


1937 


ATCTCGTGCT 


TGACTGTTTT 


ATTGAAGCGT 


TCATTTCATG 


AAGTGTCTTT 


CGATGTTGTT 


1997 


CACACTTCTG 


TTTGCTAAAT 


ATAATAAATA 


TTTTGCTTTT 


CACTTTAGAG 


CACACTGGCG 


2057 


GCCGCTCGAA 


GCTTTGGACT 


TCTTCGCCAT 


TGGTCAAGTC 


TCCAATCAAG 


GTTGTCGGCT 


2117 


TGTCTACCTT 


GCCAGAAATT 


TACGAAAAGA 


TGGAAAAGGG 


ATCCAAATCG 


TTGGTAGATA 


2177 


CTTGTTGACA 


CTTCTAAATA 


AGCGAATTTC 


TTATGATTTA 


TGATTTTTAT 


TATTAAATAA 


2237 


GTTATAAAAA 


AAATAAGGTA 


TACAAATTTT 


AAAGTGACTC 


TTAGGTTTTA 


AAACGAAAAT 


2297 


TCTTATTCTT 


GAGTAACTCT 


TTCCTGTAGG 


TCAGGTTGCT 


TTCTCAGGTA 


TAGCATGAGG 


2357 


TCGCTCTTAT 


TGACCACACC 


TCTACCGGCA 


TGCCGAGCAA 


ATGCCTGCAA 


ATCGCTCCCC 


2417 


ATTTCACCCA 


ATTGTAGATA 


TGCTAACTCC 


AGCAATGAGC 


CGATGAATCT 


CC 


2469 



(2) INFORMATION FOR SEQ ID N05 4t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

<xl) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Leu Asp Leu Arg He Gly Asn Lys Tyr Arg He Gly Arg Lys 
15 10 15 

He Gly Ser Gly Ser Phe Gly Asp He Tyr Leu Gly Thr Asn Val Val 
20 25 30 
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Ser Gly Glu Glu Val Ala lie Lys Leu Glu Ser Thr Arg Ala Lys His 
35 40 45 

Pro Gin Leu Glu Tyr Glu Tyr Arg Val Tyr Arg lie Leu Ser Gly Gly 
50 55 60 

Val Gly lie Pro Phe Val Arg Trp Phe Gly Val Glu Cys Asp Tyr Asn 
65 70 75 80 

Ala Met Val Met Asp Leu Leu Gly Pro Ser Leu Glu Asp Leu Phe Asn 
85 90 95 

Phe Cys Aen Arg Lys Phe Ser Leu Lys Thr Val Leu Leu Leu Ala Asp 
100 105 110 

Gin Leu lie Ser Arg lie Glu Phe lie His Ser Lys Ser Phe Leu His 
115 120 125 

Arg Asp lie Lys Pro Asp Asn Phe Leu Met Gly lie Gly Lys Arg Glv 
130 135 140 

Aen Gin Val Asn lie lie Asp Phe Gly Leu Ala Lys Lys Tyr Arg Asp 
145 150 155 160 

His Lys Thr His Leu His lie Pro Tyr Arg Glu Asn Lys Asn Leu Thr 
165 170 175 

Gly Thr Ala Arg Tyr Ala Ser lie Asn Thr His Leu Gly He Glu Gin 
180 185 190 

Ser Arg Arg Asp Asp Leu Glu Ser Leu Gly Tyr Val Leu Val Tyr Phe 
195 200 205 

Cys Arg Gly Ser Leu Pro Trp Gin Gly Leu Lys Ala Thr Thr Lys Lys 
210 215 220 

Gin Lys Tyr Glu Lys He Met Glu Lys Lys He Ser Thr Pro Thr Glu 
225 230 235 240 

Val Leu Cys Arg Gly Phe Pro Gin Glu Phe Ser He Tyr Leu Asn Tyr 
245 250 255 

Thr Arg Ser Leu Arg Phe Asp Asp Lys Pro Asp Tyr Ala Tyr Leu Arg 
260 265 270 

Lys Leu Phe Arg Asp Leu Phe Cys Arg Gin Ser Tyr Glu Phe Asp Tyr 
275 280 285 

Met Phe Asp Trp Thr Leu Lys Arg Lys Thr Gin Gin Asp Gin Gin His 
290 295 300 

Gin Gin Gin Leu Gin Gin Gin Leu Ser Ala Thr Pro Gin Ala He Asn 
305 310 315 320 

Pro Pro Pro Glu Arg Ser Ser Phe Arg Asn Tyr Gin Lys Gin Asn Phe 
325 330 335 

Asp Glu Lys Gly Gly Asp He Asn Thr Thr Val Pro Val He Asn Asp 
340 345 350 

Pro Ser Ala Thr Gly Ala Gin Tyr He Asn Arg Pro Asn 
355 360 365 
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(2) INFORMATION FOR SEQ ID NO: 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: SO.. 1249 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CCGCCAGTGT GCTCTAAAGG TCATCTCTGT GAATTAGAAT CTTAGCAAA ATG ACG 55 

Met Thr 
1 

GTT GTT GAC ATT AAG ATT GGT AAT AAA TAT CGT ATA GGT AGA AAA ATT 103 
Val Val Asp lie Lys lie Gly Asn Lys Tyr Arg lie Gly Arg Lys lie 
5 10 15 

GGT TCT GGC TCC TTT GGT CAA ATT TAC CTG GGA TTA AAT ACG GTA AAT 151 
Gly Ser Gly Ser Phe Gly Gin lie Tyr Leu Gly Leu Asn Thr Val Asn 
20 25 30 

GGA GAA CAA GTT GOT GTG AAA TTG GAG CCT TTA AAG GCT CGT CAT CAT 199 
Gly Glu Gin Val Ala Val Lys Leu Glu Pro Leu Lys Ala Arg His His 
35 40 45 50 

CAG TTA GAA TAT GAG TTT CGT GTG TAT AAT ATT CTT AAA GGA AAT ATT 247 
Gin Leu Glu Tyr Glu Phe Arg Val Tyr Asn lie Leu Lye Gly Asn lie 
55 60 65 

GGC ATA CCC ACA ATT CGC TGG TTC GGT GTA ACC AAT AGT TAT AAT GCT 295 
Gly lie Pro Thr lie Arg Trp Phe Gly Val Thr Asn Ser Tyr Asn Ala 
70 75 80 

ATG GTC ATG GAT TTA TTA GGC CCT TCT CTG GAA GAT TTA TTC TGC TAT 343 
Met Val Met Asp Leu Leu Gly Pro Ser Leu Glu Asp Leu Phe Cys Tyr 
85 90 95 

TGT GGA AGA AAG TTT ACT CTT AAA ACG GTT CTT TTA CTT GCT GAT CAA 391 
Cys Gly Arg Lys Phe Thr Leu Lys Thr Val Leu Leu Leu Ala Asp Gin 
100 105 110 

CTC ATC AGT CGC ATT GAA TAT GTT CAC TCC AAG TCA TTC TTA CAT CGA 439 
Leu lie Ser Arg lie Glu Tyr Val His Ser Lys Ser Phe Leu His Arg 
115 120 125 130 

GAC ATT AAG CCT GAT AAT TTT TTA ATG AAG AAG CAC AGO AAT GTT GTT 487 
Asp lie Lys Pro Asp Asn Phe Leu Met Lys Lys His Ser Asn Val Val 
135 140 145 

ACG ATG ATT GAC TTC GGA TTG GCG AAA AAA TAC AGG GAT TTT AAA ACT 535 
Thr Met lie Asp Phe Gly Leu Ala Lys Lys Tyr Arg Asp Phe Lys Thr 
150 155 160 
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CAT GTT CAT ATT CCA TAT CGA GAT AAT AAG AAT CTT ACG GGA ACG GCT 583 

His Val His lie Pro Tyr Arg Asp Asn Lys Asn Leu Thr Gly Thr Ala 
165 170 175 

CGA TAT GCT AGT ATT AAC ACC CAT ATT GGT ATT GAA CAA TCT CGC CGT 631 
Arg Tyr Ala Ser lie Asn Thr His lie Gly lie Glu Gin Ser Arg Arg 
180 185 190 

GAT GAC CTC GAA TCG TTA GGT TAT GTT TTA CTT TAT TTT TGT CGC GGC 679 
Asp Asp Leu Glu Ser Leu Gly Tyr Val Leu Leu Tyr Phe Cys Arg Gly 
195 200 205 210 

AGT TTG CCC TGG CAA GGC TTA CAA GCT GAT ACA AAG GAG CAA AAG TAT 727 
Ser Leu Pro Trp Gin Gly Leu Gin Ala Asp Thr Lys Glu Gin Lys Tyr 
215 220 225 

CAA CGG ATA CGT GAT ACC AAG ATT GGC ACT CCT TTG GAA GTC CTT TGC 775 
Gin Arg lie Arg Asp Thr Lys lie Gly Thr Pro Leu Glu Val Leu Cys 
230 235 240 

AAA GGT CTT CCC GAA GAG TTT ATC ACT TAC ATG TGT TAC ACT CGT CAG 823 
Lys Gly Leu Pro Glu Glu Phe lie Thr Tyr Met Cys Tyr Thr Arg Gin 
245 250 255 

CTT TCG TTT ACC GAG AAG CCA AAC TAT GCT TAT TTG AGA AAG CTG TTT 871 
Leu Ser Phe Thr Glu Lys Pro Asn Tyr Ala Tyr Leu Arg Lys Leu Phe 
260 265 270 

CGT GAT TTA CTT ATT CGT AAA GGA TAC CAG TAT GAC TAT GTT TTT GAC 919 
Arg Asp Leu Leu lie Arg Lys Gly Tyr Gin Tyr Asp Tyr Val Phe Asp 
275 280 285 290 

TGG ATG ATA TTA AAA TAC CAA AAG CGA GCT GCT GCT GCT GCC GCC GCT 967 
Trp Met He Leu Lys Tyr Gin Lys Arg Ala Ala Ala Ala Ala Ala Ala 
295 300 305 

TCT GCT ACA GCA CCT CCA CAG GTT ACA TCT CCT ATG GTG TCA CAA ACT 1015 
Ser Ala Thr Ala Pro Pro Gin Val Thr Ser Pro Met Val Ser Gin Thr 
'310 315 320 

CAA CCG GTT AAT CCC ATT ACT CCT AAT TAT TCA TCC ATT CCC TTA CCT 1063 
Gin Pro Val Asn Pro He Thr Pro Asn Tyr Ser Ser He Pro Leu Pro 
325 330 335 

GCT GAG CGG AAT CCA AAG ACT CCA CAA TCT TTC TCC ACT AAT ATT GTT 1111 
Ala Glu Arg Asn Pro Lys Thr Pro Gin Ser Phe Ser Thr Asn He Val 
340 345 350 

CAA TGT GCT TCT CCC TCA CCT CTT CCT CTC TCC TTT CGT TCT CCT GTT 1159 
Gin Cys Ala Ser Pro Ser Pro Leu Pro Leu Ser Phe Arg Ser Pro Val 
355 360 365 370 

CCC AAC AAA GAT TAT GAA TAC ATT CCA TCT TCG TTG CAA CCT CAA TAC 1207 
Pro Asn Lys Asp Tyr Glu Tyr He Pro Ser Ser Leu Gin Pro Gin Tyr 
375 380 385 

AGT GCT CAA CTG AGG CGT GTT TTA GAT GAA GAA CCA GCT CCT 1249 
Ser Ala Gin Leu Arg Arg Val Leu Asp Glu Glu Pro Ala Pro 
390 395 400 



TGATTTTTTG 


ACTTTACTTT 


TCATCAATTC 


CTCTCTTACA 


CTACGTCTTT 


TAGTCTTAAA 


1309 


TTCCAAACCA 


TCTGTTGACG 


TTTTAAAGTT 


CCACAAATAT 


CTTTAATAAT 


TCCTGGCTTT 


1369 


CTTTTTTGTC 


TATGGATGGC 


CGGATTGCTA 


CACTAATACA 


CTTTGAGGTT 


TAGCTATTGT 


1429 


TTTGAGCTAT 


TCCATTTTGC 


CTAGAAGTTG 


AGTTTTAATG 


CCTTCTTTTT 


AAATAGACAT 


1489 
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ATTGTGTAAA CCTCATACAT GCTTTACTGA AAAGACATAA TTAGAGGACA AAATTTAAAT 1549 

CGTGCTGTTT GTTTATATTC AGCTCGTTCC GGTCAAGTTC TTGCCAAAGA ATTGAGTCAG 1609 

TCGTGCTATT CATTTCTAAA TTTCTTCTTC CCAGAATTTT ATTTTATTGT TTTCGTTCCC 1669 

CATTGGTTCT TACATTCCGT TTTTATTCAA AACTGAAAAG TTTGTACCTC CATTGCTAGA 172 9 

AGTAATATAC ACAAGGAGCA TGTTTCTTTT TTTACACTAT CATTTGCGTG GCTCTAAACC 1789 

AGTCTTTATT GCCTACCTTT GCAATAAAAG ATATAATATC AATTGCATAA GAAATAATTC 1849 

ATTAATAAAT GATAAATTTC ATCGATTAAA TAAAAAAAAA AAACTTTAGA GCTTTAGAGC 1909 

ACAACTGGCG GCCGCTCGAA GCTTTGGACT TCTTCGCCAT TGGTCAAGTC TCAATCAAGG 1969 

TTGTCGGCTT GTCTACCTTC 1989 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Met Thr Val Val Asp lie Lys lie Gly Asn Lys Tyr Arg He Gly Arg 
15 10 15 

Lys He Gly Ser Gly Ser Phe Gly Gin He Tyr Leu Gly Leu Asn Thr 
20 25 30 

Val Asn Gly Glu Gin Val Ala Val Lys Leu Glu Pro Leu Lys Ala Arg 
35 40 45 

Hie His Gin Leu Glu Tyr Glu Phe Arg Val Tyr Asn He Leu Lys Gly 
50 55 60 

Asn He Gly He Pro Thr He Arg Trp Phe Gly Val Thr Asn Ser Tyr 
65 70 75 80 

Asn Ala Met Val Met Asp Leu Leu Gly Pro Ser Leu Glu Asp Leu Phe 
85 90 95 

Cys Tyr Cys Gly Arg Lys Phe Thr Leu Lys Thr Val Leu Leu Leu Ala 
100 105 110 

Asp Gin Leu He Ser Arg He Glu Tyr Val His Ser Lys Ser Phe Leu 
115 120 125 

His Arg Asp He Lys Pro Asp Asn Phe Leu Met Lys Lys His Ser Asn 
130 135 140 

Val Val Thr Met He Asp Phe Gly Leu Ala Lys Lys Tyr Arg Asp Phe 
145 150 155 160 

Lys Thr His Val His He Pro Tyr Arg Asp Asn Lys Asn Leu Thr Gly 
165 170 175 

Thr Ala Arg Tyr Ala Ser He Asn Thr His He Gly He Glu Gin Ser 
180 185 190 
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Arg Arg Asp Asp Leu Giu Ser Leu Gly Tyr Val Leu Leu Tyr Phe Cys 
195 200 205 

Arg Gly Ser Leu Pro Trp Gin Gly Leu Gin Ala Asp Thr Lys Giu Gin 
210 215 220 

Lys Tyr Gin Arg lie Arg Asp Thr Lys lie Gly Thr Pro Leu Giu Val 
225 230 235 240 

Leu Cys Lys Gly Leu Pro Giu Clu Phe lie Thr Tyr Met Cys Tyr Thr 
245 250 255 

Arg Gin Leu Ser Phe Thr Giu Lys Pro Asn Tyr Ala Tyr Leu Arg Lys 
260 265 270 

Leu Phe Arg Asp Leu Leu He Arg Lys Gly Tyr Gin Tyr Asp Tyr Val 
275 280 285 

Phe Asp Trp Met He Leu Lys Tyr Gin Lys Arg Ala Ala Ala Ala Ala 
290 295 300 

Ala Ala Ser Ala Thr Ala Pro Pro Gin Val Thr Ser Pro Met Val Ser 
305 310 315 320 

Gin Thr Gin Pro Val Asn Pro He Thr Pro Asn Tyr Ser Ser He Pro 
325 330 335 

Leu Pro Ala Giu Arg Asn Pro Lys Thr Pro Gin Ser Phe Ser Thr Asn 
340 345 350 

He Val Gin Cys Ala Ser Pro Ser Pro Leu Pro Leu Ser Phe Arg Ser 
355 360 365 

Pro Val Pro Asn Lys Asp Tyr Giu Tyr He Pro Ser Ser Leu Gin Pro 
370 375 380 

Gin Tyr Ser Ala Gin Leu Arg Arg Val Leu Asp Giu Giu Pro Ala Pro 
385 390 395 400 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

< ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 173.. 1147 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGCGGTGATC AGTTCCCCTC TGCTGATTCT GGGCCCGAAC CCGGTAAAGG CCTCCGTGTT 60 
CCGTTTCCTG CCGCCCTCCT CCGTAGCCTT GCCTAGTGTA GGAGCCCCGA GGCCTCCGTC 120 
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CTCTTCCCAG AGGTGTCGGG GCTTGCCCCA GCCTCCATCT TCGTCTCTCA GG ATG 17 5 

Met 
1 

GCG AGT AGC AGC GGC TCC AAG GCT GAA TTC ATT GTC GGA GGG AAA TAT 223 
Ala Ser Ser Ser Gly Ser Lys Ala Glu Phe lie Val Gly Gly Lys Tyr 
5 10 15 

AAA CTG GTA CGG AAG ATC GGG TCT GGC TCC TTC GGG GAC ATC TAT TTG 271 
Lys Leu Val Arg Lys lie Gly Ser Gly Ser Phe Gly Asp He Tyr Leu 
20 25 30 

GCG ATC AAC ATC ACC AAC GGC GAG GAA GTG GCA GTG AAG CTA GAA TCT 319 
Ala He Asn He Thr Asn Gly Glu Glu Val Ala Val Lys Leu Glu Ser 
35 40 45 

CAG AAG GCC AGG CAT CCC CAG TTG CTG TAG GAG AGC AAG CTC TAT AAG 367 
Gin Lys Ala Arg His Pro Gin Leu Leu Tyr Glu Ser Lys Leu Tyr Lys 
50 55 60 65 

ATT CTT CAA GGT GGG GTT GGC ATC CCC CAC ATA CGG TGG TAT GGT CAG 415 
He Leu Gin Gly Gly Val Gly He Pro His He Arg Trp Tyr Gly Gin 
70 75 80 

GAA AAA GAC TAG AAT GTA CTA GTC ATG GAT CTT CTG GGA CCT AGC CTC 463 
Glu Lys Asp Tyr Asn Val Leu Val Met Asp Leu Leu Gly Pro Ser Leu 
85 90 95 

GAA GAC CTC TTC AAT TTC TGT TCA AGA AGG TTC ACA ATG AAA ACT GTA 511 
Glu Asp Leu Phe Asn Phe Cys Ser Arg Arg Phe Thr Met Lys Thr Val 
100 105 110 

CTT ATG TTA GCT GAC CAG ATG ATC AGT AGA ATT GAA TAT GTG CAT ACA 559 
Leu Met Leu Ala Asp Gin Met He Ser Arg He Glu Tyr Val His Thr 
115 120 125 

AAG AAT TTT ATA CAC AGA GAC ATT AAA CCA GAT AAC TTC CTA ATG GGT 607 
Lys Asn Phe He His Arg Asp He Lys Pro Asp Asn Phe Leu Met Gly 
130 135 140 145 

ATT GGG CGT CAC TGT AAT AAG TTA TTC CTT ATT GAT TTT GGT TTG GCC 655 
He Gly Arg His Cys Asn Lys Leu Phe Leu He Asp Phe Gly Leu Ala 
150 155 160 

AAA AAG TAG AGA GAC AAC AGG ACA AGG CAA CAC ATA CCA TAC AGA GAA 703 
Lys Lys Tyr Arg Asp Asn Arg Thr Arg Gin His He Pro Tyr Arg Glu 
165 170 175 

GAT AAA AAC CTC ACT GGC ACT GCC GGA TAT GCT AGC ATC AAT GCA CAT 751 
Asp Lys Asn Leu Thr Gly Thr Ala Arg Tyr Ala Ser He Asn Ala His 
180 185 190 

CTT GGT ATT GAG CAG AGT CGC CGA GAT GAC ATG GAA TCA TTA GGA TAT 799 
Leu Gly He Glu Gin Ser Arg Arg Asp Asp Met Glu Ser Leu Gly Tyr 
195 200 205 

GTT TTG ATG TAT TTT AAT AGA ACC AGC CTG CCA TGG CAA GGG CTA AAG 847 
Val Leu Met Tyr Phe Asn Arg Thr Ser Leu Pro Trp Gin Gly Leu Lys 
210 215 220 225 

GCT GCA ACA AAG AAA CAA AAA TAT GAA AAG ATT AGT GAA AAG AAG ATG 895 
Ala Ala Thr Lys Lys Gin Lys Tyr Glu Lys He Ser Glu Lys Lys Met 
230 235 240 

TCC ACG CCT GTT GAA GTT TTA TGT AAG GGG TTT CCT GCA GAA TTT GCG 943 
Ser Thr Pro Val Glu Val Leu Cys Lys Gly Phe Pro Ala Glu Phe Ala 
245 250 255 
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ATG TAG TTA AAC TAT TGT CGT GGG CTA CGC TTT GAG GAA GCC CCA GAT 991 
Met Tyr Leu Asn Tyr Cys Arg Gly Leu Arg Phe Glu Glu Ala Pro Asp 
260 265 270 

TAG ATG TAT CTG AGG CAG CTA TTC CGC ATT CTT TTC AGG ACC CTG AAC 1039 
Tyr Met Tyr Leu Arg Gin Leu Phe Arg lie Leu Phe Arg Thr Leu Asn 
275 280 285 

CAT CAA TAT GAC TAG AC A TTT GAT TGG AC A ATG TTA AAG CAG AAA GCA 1087 
His Gin Tyr Asp Tyr Thr Phe Asp Trp Thr Met Leu Lys Gin Lya Ala 
290 295 300 305 

GCA CAG CAG GCA GCC TCT TCC AGT GGG CAG GGT CAG CAG GCC CAA ACC 1135 
Ala Gin Gin Ala Ala Ser Ser Ser Gly Gin Gly Gin Gin Ala Gin Thr 
310 315 320 

CCC ACA GGT TTC TAAGCATGAA TTGAGGAACA GAAGAAGCAG AGCAGATGAT 1187 
Pro Thr Gly Phe 
325 

CGAGCAGCAT TTGTTTCTCC CAA 1210 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ala Ser Ser Ser Gly Ser Lys Ala Glu Phe lie Val Gly Gly Lys 
15 10 15 

Tyr Lys Leu Val Arg Lys lie Gly Ser Gly Ser Phe Gly Asp lie Tyr 
20 25 30 

Leu Ala lie Asn lie Thr Asn Gly Glu Glu Val Ala Val Lys Leu Glu 
35 40 45 

Ser Gin Lys Ala Arg His Pro Gin Leu Leu Tyr Glu Ser Lys Leu Tyr 
50 55 60 

Lys lie Leu Gin Gly Gly Val Gly lie Pro His lie Arg Trp Tyr Gly 
65 70 75 80 

Gin Glu Lys Asp Tyr Asn Val Leu Val Met Asp Leu Leu Gly Pro Ser 
85 90 95 

Leu Glu Asp Leu Phe Asn Phe Cys Ser Arg Arg Phe Thr Met Lys Thr 
100 105 110 

Val Leu Met Leu Ala Asp Gin Met lie Ser Arg lie Glu Tyr Val His 
115 120 125 

Thr Lys Asn Phe lie His Arg Asp lie Lys Pro Asp Asn Phe Leu Met 
130 135 140 

Gly lie Gly Arg His Cys Asn Lys Leu Phe Leu lie Asp Phe Gly Leu 
145 150 155 160 

Ala Lys Lys Tyr Arg Asp Asn Arg Thr Arg Gin His lie Pro Tyr Arg 
165 170 175 



BNSDOCID: <WO 9417189A2_I_> 



wo 94/17189 



PCT/US94/00795 



-79- 

GIu Asp Lys Asn Leu- Thr Gly Thr Ala Arg Tyr Ala Ser lie Asn Ala 
180 185 190 

His Leu Gly lie Glu Gin Ser Arg Arg Asp Asp Met Glu Ser Leu Gly 
195 200 205 

Tyr Val Leu Met Tyr Phe Asn Arg Thr Ser Leu Pro Trp Gin Gly Leu 
210 215 220 

Lys Ala Ala Thr Lys Lys Gin Lys Tyr Glu Lys lie Ser Glu Lys Lys 
225 230 235 240 

Met Ser Thr Pro Val Glu Val Leu Cys Lys Gly Phe Pro Ala Glu Phe 
245 250 255 

Ala Met Tyr Leu Asn Tyr Cys Arg Gly Leu Arg Phe Glu Glu Ala Pro 
260 265 270 

Asp Tyr Met Tyr Leu Arg Gin Leu Phe Arg lie Leu Phe Arg Thr Leu 
275 280 285 

Asn His Gin Tyr Asp Tyr Thr Phe Asp Trp Thr Met Leu Lys Gin Lys 
290 295 300 

Ala Ala Gin Gin Ala Ala Ser Ser Ser Gly Gin Gly Gin Gin Ala Gin 
305 310 315 320 

Thr Pro Thr Gly Phe 
325 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2385 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 97., 1388 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GAATTCCGAT AGTATTATGT GGAGTTCCAT TTTTATGTAT TTTTTGTATG AAATATTCTA 60 

GTATAAGTAA ATATTTTATC AGAAGTATTT ACATATCTTT TTTTTTTTTA GTTTGAGAGC 120 

GGCGGTGATC AGGTTCCCCT CTGCTGATTC TGGGCCCCGA ACCCCGGTAA AGGCCTCCGT 180 

GTTCCGTTTC CTGCCGCCCT CCTCCGTAGC CTTGCCTAGT GTAGGAGCCC CGAGGCCTCC 240 

GTCCTCTTCC CAGAGGTGTC GGGGCTTGGC CCCAGCCTCC ATCTTCGTCT CTCAGG 296 

ATG GCG AGT AGC AGO GGC TCC AAG GCT GAA TTC ATT GTC GGA GGG AAA 344 
Met Ala Ser Ser Ser Gly Ser Lys Ala Glu Phe He Val Gly Gly Lys 
15 10 15 

TAT AAA CTG GTA CGG AAG ATC GGG TCT GGC TCC TTC GGG GAC ATC TAT 392 
Tyr Lys Leu Val Arg Lys He Gly Ser Gly Ser Phe Gly Asp He Tyr 
20 25 30 
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TTG GCG ATC AAC ATC ACC AAC GGC GAG GAA GTG GCA GTG AAG CTA GAA 440 
Leu Ala lie Aan lie Thr Asn Gly Glu Glu Val Ala Val Lys Leu Glu 
35 40 45 

TCT CAG AAG GCC AGG CAT CCC CAG TTG CTG TAG GAG AGC AAG CTC TAT 488 
Ser Gin Lys Ala Arg His Pro Gin Leu Leu Tyr Glu Ser Lys Leu Tyr 
50 55 60 

AAG ATT CTT CAA GGT GGG GTT GGC ATC CCC CAC ATA CGG TGG TAT GGT 536 
Lys lie Leu Gin Gly Gly Val Gly lie Pro His lie Arg Trp Tyr Gly 
65 70 75 80 

CAG GAA AAA GAC TAC AAT GTA CTA GTC ATG GAT CTT CTG GGA CCT AGC 584 
Gin Glu Lys Asp Tyr Asn Val Leu Val Met Asp Leu Leu Gly Pro Ser 
85 90 95 

CTC GAA GAC CTC TTC AAT TTC TGT TCA AGA AGG TTC AC A ATG AAA ACT 632 
Leu Glu Asp Leu Phe Asn Phe Cys Ser Arg Arg Phe Thr Met Lys Thr 
100 105 110 

GTA CTT ATG TTA GCT GAC CAG ATG ATC ACT AGA ATT GAA TAT GTG CAT 680 
Val Leu Met Leu Ala Asp Gin Met lie Ser Arg lie Glu Tyr Val His 
115 120 125 

ACA AAG AAT TTT ATA CAC AGA GAC ATT AAA CCA GAT AAC TTC CTA ATG 728 
Thr Lys Asn Phe lie His Arg Asp lie Lya Pro Asp Asn Phe Leu Met 
130 135 140 

GGT ATT GGG CGT CAC TGT AAT AAG TGT TTA GAA TCT CCA GTG GGG AAG 776 
Gly lie Gly Arg His Cys Asn Lys Cys Leu Glu Ser Pro Val Gly Lys 
145 ISO 155 160 

AGG AAA AGA AGC ATG ACT GTT AGT ACT TCT CAG GAC CCA TCT TTC TCA 824 
Arg Lys Arg Ser Met Thr Val Ser Thr Ser Gin Asp Pro Ser Phe Ser 
165 170 175 

GGA TTA AAC CAG TTA TTC CTT ATT GAT TTT GGT TTG GCC AAA AAG TAC 872 
Gly Leu Asn Gin Leu Phe Leu lie Asp Phe Gly Leu Ala Lys Lys Tyr 
180 185 190 

AGA GAC AAC AGG ACA AGG CAA CAC ATA CCA TAC AGA GAA GAT AAA AAC 920 
Arg Asp Asn Arg Thr Arg Gin His lie Pro Tyr Arg Glu Asp Lys Asn 
195 200 205 

CTC ACT GGC ACT GCC CGA TAT GCT AGC ATC AAT GCA CAT CTT GGT ATT 968 
Leu Thr Gly Thr Ala Arg Tyr Ala Ser lie Asn Ala His Leu Gly lie 
210 215 220 

GAG CAG AGT GGC CGA GAT GAC ATG GAA TCA TTA GGA TAT GTT TTG ATG 1016 
Glu Gin Ser Arg Arg Asp Asp Met Glu Ser Leu Gly Tyr Val Leu Met 
225 230 235 240 

TAT TTT AAT AGA ACC AGC CTG CCA TGG CAA GGG CTA AAG GGT GCA ACA 1064 
Tyr Phe Asn Arg Thr Ser Leu Pro Trp Gin Gly Leu Lys Ala Ala Thr 
245 250 255 

AAG AAA CAA AAA TAT GAA AAG ATT AGT GAA AAG AAG ATG TCC ACG CCT 1112 
Lys Lys Gin Lys Tyr Glu Lys lie Ser Glu Lys Lys Met Ser Thr Pro 
260 265 270 

GTT GAA GTT TTA TGT AAG GGG TTT CCT GCA GAA TTT GCG ATG TAC TTA 1160 
Val Glu Val Leu Cys Lys Gly Phe Pro Ala Glu Phe Ala Met Tyr Leu 
275 280 285 

AAC TAT TGT CGT GGG CTA CGC TTT GAG GAA GCC CCA GAT TAC ATG TAT 1208 
Asn Tyr Cys Arg Gly Leu Arg Phe Glu Glu Ala Pro Asp Tyr Met Tyr 
290 295 300 
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CTG AGG CAG CTA TTC CGC ATT CTT TTC AGG ACC CTG AAC CAT CAA TAT 1256 

Leu Arg Gin Leu Phe Arg lie Leu Phe Arg Thr Leu Asn His Gin Tyr 

305 310 315 320 

GAG TAG ACA TTT GAT TGG ACA ATG TTA AAG CAG AAA GCA GCA CAG CAG 1304 
Asp Tyr Thr Phe Asp Trp Thr Met Leu Lys Gin Lys Ala Ala Gin Gin 
325 330. 335 

GCA GCC TCT TCC AGT GGG CAG GGT CAG CAG GCC CAA ACC CCC ACA GGC 1352 
Ala Ala Ser Ser Ser Gly Gin Gly Gin Gin Ala Gin Thr Pro Thr Gly 
340 345 350 

AAG CAA ACT GAC AAA ACC AAG AGT AAC ATG AAA GGT TAGTAGCCAA 1398 
Lys Gin Thr Asp Lys Thr Lys Ser Asn Met Lys Gly 
355 360 



GAACCAAGTG 


ACGTTACAGG 


GAAAAAATTG 






AX X X ^X AAwA 


^ ACQ 


GTG T TAG AT C 


AAGGAGGTGG 


T TTTAAAAT A 






X X Aiu\iuulAA 


imp 
X 3X a 


AAAAGACGTC 


CTTGGAAAAT 


TTGACTACTA 


■fi /^•T»»i»n' n & A r^f^ 
AC-l 1 i.iiAAL.L.' 




#p rp rp M TV a 2 

X X X OA X A X A 


X 3 / O 


TATGTATATG 


TATTTGTATA 


TACATATATG 


TGTGTATATT 


TATATCATTT 


CTCTTGGGAT 


1638 


TTTGGGTCAT 


TTTTTTAACA 


ACTGCATCTT 


TTTTACTCAT 


TCATTAACCC 


CCTTTCCAAA 


1698 


AATTTGGTGT 


TGGGAATATA 


ATATAATCAA 


TCAATCCAAA 


ATCCTAGACC 


TAACACTTGT 


1758 


TGATTTCTAA 


TAATGAATTT 


GGTTAGCCAT 


ATTTTGACTT 


TATTTCAGAC 


TAACAATGTT 


1818 


AAGATTTTTT 


ATTTTGCATG 


TTAATGCTTT 


AGCATTTAAA 


ATGGAAAATT 


GTGAACATGT 


1878 


TGTAATTTCA 


AGAGGTGAGT 


TTGGCATTAC 


CCCCAAAGTG 


TCTATCTTCT 


CAGTTGCAGA 


1938 


GCATCTCATT 


TTCTCTCTTA 


AATGCTCAAA 


TAAATGCAAA 


GCTCAGCACA 


TCTTTTCTAG 


1998 


TCACAAAAAT 


AATTCTTTTA 


TTTGCAGTTT 


ACGTATGATC 


TTAATTTCAA 


AACGATTTCT 


2058 


TTGTTTTTGG 


CTTGATTTTT 


CACAATGTTG 


CAAATATCAG 


GCTCCCAGGG 


TTTAATGTGG 


2118 


AATTGAAGTC 


TGCAGCCAGG 


CCTTGCAAAT 


TGAAGGTAAC 


TGGGGCAAAT 


GCCATTGAAA 


2178 


CCGCTAGTCT 


TATTTCCTTT 


CTACTTTTCT 


TTGGCACTCT 


TACTGCCTGT 


AAGGAGTAGA 


2238 


ACTGTTAAGG 


CACACTGTTG 


CTATACAGTT 


AACTCCCATT 


TTCATGTTTT 


GTCTTTCTTT 


2298 


TCCCATTTCT 


GGGGCTTACC 


TCCTGATACC 


TGCTTACTTT 


CTGGAAGTAG 


TGGGCAAGTA 


2358 


AGATTT6GCT 


CTTGGTTTCT 


GGAATTC 








2385 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 364 amino acids 

(B) TYPE: cunino acid 
<D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Ser Ser Ser Gly Ser Lys Ala Glu Phe lie Val Gly Gly Lys 
1 5 10 15 

Tyr Lys Leu Val Arg Lys lie Gly Ser Gly Ser Phe Gly Asp lie Tyr 
20 25 30 
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Leu Ala lie Asn He Thr Asn Gly Glu Glu Val Ala Val Lys Leu Glu 
35 40 45 

Ser Gin Lys Ala Arg His Pro Gin Leu Leu Tyr Glu Ser Lvs Leu Tvr 
50 55 60 

Lys lie Leu Gin Gly Gly Val Gly He Pro His He Arg Trp Tyr Gly 
65 70 75 80 

Gin Glu Lys Asp Tyr Asn Val Leu Val Met Asp Leu Leu Gly Pro Ser 
85 90 95 

Leu Glu Asp Leu Phe Asn Phe Cys Ser Arg Arg Phe Thr Met Lys Thr 
100 105 110 

Val Leu Met Leu Ala Asp Gin Met He Ser Arg He Glu Tyr Val His 
115 120 125 

Thr Lys Asn Phe He His Arg Asp He Lys Pro Asp Asn Phe Leu Met 
130 135 140 

Gly He Gly Arg His Cys Asn Lys Cys Leu Glu Ser Pro Val Gly Lvs 
145 ISO 155 160 

Arg Lys Arg Ser Met Thr Val Ser Thr Ser Gin Asp Pro Ser Phe Ser 
165 170 175 

Gly Leu Asn Gin Leu Phe Leu He Asp Phe Gly Leu Ala Lys Lys Tyr 
180 185 190 

Arg Asp Asn Arg Thr Arg Gin His He Pro Tyr Arg Glu Asp Lys Asn 
195 200 205 

Leu Thr Gly Thr Ala Arg Tyr Ala Ser He Asn Ala His Leu Gly He 
210 215 220 

Glu Gin Ser Arg Arg Asp Asp Met Glu Ser Leu Gly Tyr Val Leu Met 
225 230 235 240 

Tyr Phe Asn Arg Thr Ser Leu Pro Trp Gin Gly Leu Lys Ala Ala Thr 
245 250 255 

Lye Lys Gin Lys Tyr Glu Lys He ser Glu Lys Lys Met Ser Thr Pro 
260 265 270 

Val Glu Val Leu Cys Lys Gly Phe Pro Ala Glu Phe Ala Met Tyr Leu 
275 280 285 

Asn Tyr Cys Arg Gly Leu Arg Phe Glu Glu Ala Pro Asp Tyr Met Tvr 
290 295 300 

Leu Arg Gin Leu Phe Arg He Leu Phe Arg Thr Leu Asn His Gin Tvr 
305 310 315 320 

Asp Tyr Thr Phe Asp Trp Thr Met Leu Lys Gin Lys Ala Ala Gin Gin 
325 330 335 

Ala Ala Ser Ser Ser Gly Gin Gly Gin Gin Ala Gin Thr Pro Thr Gly 
340 345 350 

Lys Gin Thr Asp Lys Thr Lys Ser Asn Met Lys Gly 
355 360 
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(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2914 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

{ ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 65.. 1275 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GAATTCCCGA GAAACAAGTG GCCCAGCCTG GTAACCGCCG AGAAGCCCTT CACAAACTGC 60 

GGCCTGGCAA AAAGAAACCT GACTGAGCGG CGGTGATCAG GTTCCCCTCT GCTGATTCTG 120 

GGCCCCGAAC CCCGGTAAAG GCCTCCGTGT TCCGTTTCCT GCCGCCCTCC TCCGTAGCCT 180 

TGCCTAGTGT AGGAGCCCCG AGGCCTCCGT CCTCTTCCCA GAGGTGTCGG GGCTTGGCCC 240 

CAGCCTCCAT CTTCGTCTCT CAGG ATG GCG AGT AGC AGC GGC TCC AAG GCT 291 

Met Ala ser Ser Ser Gly ser Lys Ala 
1 5 

GAA TTC ATT GTC GGA GGG AAA TAT AAA CTG GTA CGG AAG ATC GGG TCT 339 
Glu Phe lie Val Gly Gly Lys Tyr Lys Leu Val Arg Lys lie Gly Ser 
10 15 20 25 

GGC TCC TTC GGG GAC ATC TAT TTG GCG ATC AAC ATC ACC AAC GGC GAG 387 

Gly Ser Phe Gly Asp lie Tyr Leu Ala lie Asn lie Thr Asn Gly Glu 

30 35 40 

* 

GAA GTG GCA GTG AAG CTA GAA TCT CAG AAG GCC AGG CAT CCC CAG TTG 435 
Glu Val Ala Val Lys Leu Glu Ser Gin Lys Ala Arg His Pro Gin Leu 
45 50 55 

CTG TAC GAG AGC AAG CTC TAT AAG ATT CTT CAA GGT GGG GTT GGC ATC 483 
Leu Tyr Glu Ser Lys Leu Tyr Lys lie Leu Gin Gly Gly Val Gly lie 
60 65 70 

CCC CAC ATA CGG TGG TAT GGT CAG GAA AAA GAC TAC AAT GTA CTA GTC 531 
Pro His lie Arg Trp Tyr Gly Gin Glu Lys Asp Tyr Asn Val Leu Val 
75 80 85 

ATG GAT CTT CTG GGA CCT AGC CTC GAA GAC CTC TTC AAT TTC TGT TCA 579 
Met Asp Leu Leu Gly Pro Ser Leu Glu Asp Leu Phe Asn Phe Cys Ser 
90 95 100 105 

AGA AGG TTC ACA ATG AAA ACT GTA CTT ATG TTA GCT GAC CAG ATG ATC 627 
Arg Arg Phe Thr Met Lys Thr Val Leu Met Leu Ala Asp Gin Met lie 
110 115 120 

AGT AGA ATT GAA TAT GTG CAT ACA AAG AAT TTT ATA CAC AGA GAC ATT 675 
Ser Arg lie Glu Tyr Val His Thr Lys Asn Phe lie His Arg Asp lie 
125 130 135 

AAA CCA GAT AAC TTC CTA ATG GGT ATT GGG CGT CAC TGT AAT AAG TTA 723 
Lys Pro Asp Asn Phe Leu Met Gly lie Gly Arg His Cys Asn Lys Leu 
140 145 150 
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TTC CTT ATT GAT TTT GGT TTG GCC AAA AAG TAG AGA GAG AAC AGG ACA 771 

Phe Leu lie Asp Phe Gly Leu Ala Lys Lys Tyr Arg Asp Asn Arg Thr 
155 160 165 

AGG CAA CAC ATA CCA TAG AGA GAA GAT AAA AAC CTC ACT GGC ACT GCC 819 

Arg Gin His lie Pro Tyr Arg Glu Asp Lys Asn Leu Thr Gly Thr Ala 
170 175 180 185 

CGA TAT GCT AGC ATC AAT GCA CAT CTT GGT ATT GAG CAG AGT CGC CGA 867 

Arg Tyr Ala Ser lie Asn Ala His Leu Gly lie Glu Gin Ser Arg Arg 
190 195 200 

GAT GAC ATG GAA TCA TTA GGA TAT GTT TTG ATG TAT TTT AAT AGA ACC 915 

Asp Asp Met Glu Ser Leu Gly Tyr Val Leu Met Tyr Phe Asn Arg Thr 

205 210 215 

AGC CTG CCA TGG CAA GGG CTA AAG GCT GCA ACA AAG AAA CAA AAA TAT 963 

Ser Leu Pro Trp Gin Gly Leu Lys Ala Ala Thr Lys Lys Gin Lys Tyr 
220 225 230 

GAA AAG ATT AGT GAA AAG AAG ATG TCC ACG CCT GTT GAA GTT TTA TGT 1011 

Glu Lys lie Ser Glu Lys Lys Met Ser Thr Pro Val Glu Val Leu Cys 
235 240 245 

AAG GGG TTT CCT GCA GAA TTT GCG ATG TAG TTA AAC TAT TGT CGT GGG 1059 

Lys Gly Phe Pro Ala Glu Phe Ala Met Tyr Leu Asn Tyr Cys Arg Gly 
250 255 260 265 

CTA CGC TTT GAG GAA GCC CCA GAT TAG ATG TAT CTG AGG CAG CTA TTC 1107 

Leu Arg Phe Glu Glu Ala Pro Asp Tyr Met Tyr Leu Arg Gin Leu Phe 
270 275 280 

CGC ATT CTT TTC AGG ACC CTG AAC CAT CAA TAT GAC TAC ACA TTT GAT 1155 

Arg lie Leu Phe Arg Thr Leu Asn His Gin Tyr Asp Tyr Thr Phe Asp 

285 290 295 

TGG ACA ATG TTA AAG CAG AAA GCA GCA CAG CAG GCA GCC TCT TCC AGT 1203 

Trp Thr Met Leu Lys Gin Lys Ala Ala Gin Gin Ala Ala Ser Ser Ser 
300 305 310 

GGG CAG GGT CAG CAG GCC C7VA ACC CCC ACA GGC AAG CAA ACT GAC AAA 1251 

Gly Gin Gly Gin Gin Ala Gin Thr Pro Thr Gly Lys Gin Thr Asp Lys 
315 320 325 



ACC AAG AGT AAC ATG AAA GGT TTC TAAGCATGAA TTGAGGAACA GAAGAAGCAG 
Thr Lys Ser Asn Met Lys Gly Phe 
330 335 


1305 


AGCAGATGAT 


CGGAGCAGCA 


TTTGTTTCTC 


CCCAAATCTA 


GAAATTTTAG 


TTCATATGTA 


1365 


CACTAGCCAG 


TGGTTGTGGA 


CAACCATTTA 


CTTGGTGTAA 


AGAACTTAAT 


TTCAGTATAA 


1425 


ACTGACTCTG 


GGCAGCATTG 


GTGATGCTGT 


ATCCTGAGTT 


GTAGCCTCTG 


TAATTGTGAA 


1485 


TATTAACTGA 


GATAGTGAAA 


CATGGTGTCC 


GGTTTTCTAT 


T6CATTTTTT 


CAAGTGGAAA 


1545 


AGTTAACTAA 


ATGGTTGACA 


CACAAAAATT 


GGTGGAGAAA 


TTGTGCATAT 


GCCAATTTTT 


1605 


TGTTAAAACC 


TTTTGTTTTG 


AACTATACTG 


CTTTGAGATC 


TCATTTCAGA 


AGAACGGCAT 


1665 


GAACAGTCTT 


CAGCCACAGT 


TGTGATGGTT 


GTTAAATGCT 


CACAATTGTG 


CATTCTTAGG 


1725 


GTTTTT CC AT 


CCCTGGGGTT 


TGCAAGTTGT 


TCACTTAAAA 


CATTCTTAAA 


ATGGTTGGCT 


1785 


TCTTGTCTGC 


AAGCCAGCTG 


ATATGGTAGC 


AACCAAAGAT 


TCCAGTGTTT 


GAGCATATGA 


1845 


AAGACTCTGC 


CTGCTTAATT 


GTGCTAGAAA 


TAACAGCATC 


TAAAGTGAAG 


ACTTAAGAAA 


1905 
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AACTTAGTGA CTACTAGATT ATCCTTAGGA CTCTGCATTA ACTCTATAAT GTTCTTGGTA 1965 

TTAAAAAAAA AGCATATTTG TCACAGAAAT TTAGTTAACA TCTTACAACT GAACATGTAT 2025 

GTATGTTGCT TAGATAAATG TAATCACTGT AAACATCTAT ATGATCTGGG ATTTTGTTTT 2085 

TATTTTGAAA TGGGAGCTTT TTTGTTTACA AGTTCATTAA AAACTAAAAA CTGTTTCTGT 2145 

AAGGAAATGA GATTTTTTTT AAACAACAAA AAATGCCTTG CTGACTCACT ATTAAATAAA 2205 

AATCTCCCCA ATTTTTTGAT AGACTACTTC AAGCCATTTG TTACATGGTA TTCCTTTGCA 2265 

AGTCAATTTA GGTTTCGTGT TATAACTTTT CCTCTTTTTT TAAGAAAAAT GAAAAAAGTA 2325 

ATTCTTTTGT CTGAAGGGGA AAGGCATTCT TTCATTTTTT TCTTTTTTTT ttTTTTTTTT 2385 

TTATGACTTG CAGGCACAAT ATCTAGTACT GCAACTGCCA GAACTTGGTA TTGTAGCTGC 2445 

TGCCCGCTGA CTAGCAGCTG GACTGATTTT GAATAAAAAT GAAAGCAGTA CTGGGATTAC 2505 

AGGTGAGCCA CAGTGCCTGG CCCTTTTTTG TTTTTATTGT CTGTCTCCCC ACTAGAAGGT 2565 

ACGCTCTACA AGGGCAGGGA TTTGTGCATC TTATTCATAG TGTTTCCCAC GTGGCAGATG 2625 

CTCACTAAAG ATTTCAAAGG AGAAACTGTG ATGGACTCGT TCTGTAGATG AGAGAACAGA 2685 

GGCACAGAGA CCTGTCCATG GTCCCCTGGC AGAAGGAGGT GGGGTCTGGA TTCCACCCCA 2745 

GGGCTGCGTG GCTGCAGGAC CTCAGTGCTT GACTCCACAC TGCTGAGGGC TGTGAGTCCC 2805 

TGGCCAGCCC AGACACAGTC CTGCAGCCCA GGCTGAGCAT TCTCAGACCT TCATGGAGAT 2865 

GCCCACTCTC CTGTGAGCCT CCTGCTTCCT TTGCCCAGGG CCGGAATTC 2914 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 : 

Met Ala Ser Ser Ser Gly Ser Lys Ala Glu Phe lie Val Gly Gly Lys 
15 10 15 

Tyr Lys Leu Val Arg Lys lie Gly Ser Gly Ser Phe Gly Asp lie Tyr 
20 25 30 

Leu Ala lie Asn lie Thr Asn Gly Glu Glu Val Ala Val Lys Leu Glu 
35 40 45 

Ser Gin Lys Ala Arg His Pro Gin Leu Leu Tyr Glu Ser Lys Leu Tyr 
50 55 60 

Lys lie Leu Gin Gly Gly Val Gly He Pro His He Arg Trp Tyr Gly 
65 70 75 80 

Gin Glu Lys Asp Tyr Asn Val Leu Val Met Asp Leu Leu Gly Pro Ser 
85 90 95 

Leu Glu Asp Leu Phe Asn Phe Cys Ser Arg Arg Phe Thr Met Lys Thr 
100 105 110 
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Val Leu Met Leu Ala Asp Gin Met He Ser Arg He Glu Tyr Val His 
115 120 125 

Thr Lys Asn Phe lie His Arg Asp He Lys Pro Asp Asn Phe Leu Met 
130 135 140 

Giy lie Giy Arg His Cys Asn Lys Leu Phe Leu He Asp Phe Gly Leu 
145 ISO 155 160 

Ala Lys Lys Tyr Arg Asp Asn Arg Thr Arg Gin His He Pro Tyr Arg 
165 170 175 

Glu Asp Lys Asn Leu Thr Gly Thr Ala Arg Tyr Ala Ser He Asn Ala 
180 185 190 

His Leu Gly He Glu Gin Ser Arg Arg Asp Asp Met Glu Ser Leu Gly 
195 200 205 

Tyr Val Leu Met Tyr Phe Asn Arg Thr Ser Leu Pro Trp Gin Gly Leu 
210 215 220 

Lys Ala Ala Thr Lys Lys Gin Lys Tyr Glu Lys He Ser Glu Lys Lys 
225 230 235 240 

Met Ser Thr Pro Val Glu Val Leu Cys Lys Gly Phe Pro Ala Glu P^e 
245 250 255 

Ala Met Tyr Leu Asn Tyr Cys Arg Gly Leu Arg Phe Glu Glu Ala Pro 
260 265 270 

Asp Tyr Met Tyr Leu Arg Gin Leu Phe Arg He Leu Phe Arg Thr Leu 
275 280 285 

Asn His Gin Tyr Asp Tyr Thr Phe Asp Trp Thr Met Leu Lys Gin Lys 
290 295 300 

Ala Ala Gin Gin Ala Ala Ser Ser Ser Gly Gin Gly Gin Gin Ala Gin 
305 310 315 320 

Thr Pro Thr Gly Lys Gin Thr Asp Lys Thr Lys Ser Asn Met Lys Gly 
325 330 335 

Phe 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRTVNDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 



( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 23 

(D) OTHER INFORMATION: /note= "Bases designated N at 
positi ns 3, 6, 9, 12 and 18 are Inosine." 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13r 
GGNWSNGGNW SNTTYGGNGA YAT 23 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..23 

(D) OTHER INFORMATION: /note= "Bases designated N at 
positions 6, 12 and 18 are Inosine." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CAYGMNGAYA TNAARCCNGA YAA 23 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..24 

(D> OTHER INFORMATION: /note= "Bases designated N at 
positions 7, 13 and 19 are Inosine," 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
RTTRTCNGGY TTNATRTCNC KRTG 24 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 
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( ix ) FEATURE : 

<A) NAME/KEY: CDS 
(B) LOCATION: 1..18 

(D) OTHER INFORMATION: /note= "Bases designated N at 
positions 1, 4, 7 and 13 are Inosine," 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
NCCNARNSWy TCNARRTC 18 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TVPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 
<B) LOCATION: 1.-20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ATATAAACTG GTACGGAAGA 20 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: l.,17 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ACATACGGTG GTATGGT 17 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1, . 19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
ATGACATGGA ATCATTAGG 19 
(2) INFORMATION PGR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CCTAATGATT CCATGTCAT 19 



(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: IS base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TCAGGTACAT GTAATCCG 18 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 39 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 : 
CCTGATCGAT TCCAGCCTGA TCGCTACTTC TTCACCACT 39 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3627 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

( ix ) FEATURE ; 

(A) NAME/KEY: CDS 

(B) LOCATION: 1633,.3204 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



GATCAGATGA 


TATAGCTTTT 


TGTGTGCCGT 


ACCTTTCCGC 


GATTCTGCCC 


GTATATCTTG 


60 


GTCCCTGAGC 


TATTTTCTGA 


GATTCTTTTT 


GTTGCTTTGC 


CAAATCATTG 


GCGTCATTCA 


120 


TGGTCATACC 


AAATCCCAAT 


TTGGCAAACT 


TGGGTGTTAA 


AGTATCTTGC 


TGTTCTTTT C 


180 


TAGTTGTGTC 


GAAGCTGTTT 


GAAGTGTCAT 


TTAAAAAATC 


ATTGAATTCA 


TCAGGCTGGG 


240 


TATTAATATC 


ATCTATACTG 


TTATTATTGT 


TGCCTTTACT 


GTTATTCATA 


AATTGGGAAT 


300 


CGTAATCATT 


TGTCTAATTT 


TGGTGCTAGA 


AGACGAATTA 


GTGAACTCGT 


CCTCCTTTTC 


360 


TTGTTGAGCC 


TCTTTTTTAA 


ATTGATCAAA 


CAAGTCTTCT 


GCCTGTGATT 


TGTCGACTTT 


420 


CTTTGCGGTT 


AGTCTAGTGG 


GCTTTCTTGA 


CGAAGACAAA 


ATTGAATGTT 


TCTTTTTATC 


480 


TTGCGAGTTT 


AATACCGGTT 


TCTTTCTGCA 


TGCCGTTAAG 


ATGGAACTCT 


CGTTTTAGTG 


540 


ACAGTGGTCT 


TGGGTGTGCT 


GCCTGTGGTG 


TTGTTTTTTG 


GGGCGAGAGA 


GCCTGTATTT 


600 


ACATTGAGTT 


TAGAACTGGA 


ATTGGAGCTT 


GGTTTTTGCC 


AATTAGAGAA 


AAAATCGTCA 


660 


ACACTATTTT 


CTTTGGAAGT 


CGACCTGGAA 


GCGTCTGAAT 


CGGTGTCCAA 


CGGTGAGTCC 


720 


GAAGAATCTT 


GACCGTTCAA 


GACTAATTCT 


GATGGGTATA 


ACTCCATATC 


CTTTTGAACC 


780 


TTCTTGTCGA 


GATGTATCTT 


ATATTTCTTA 


GCAACAGGGC 


TCGTATATTT 


TGTTTTCGCG 


840 


TCAACATTTG 


CTGTATTTAG 


TAGCTGTTTC 


CCATTGTTCT 


TTAAGAAAAA 


ATCACGAGCC 


900 


TTATGGTTCC 


CACCCAACTT 


AAACCTTCTT 


AAATTGTTAA 


TTGTCCATTT 


ATCTAATGTA 


960 


GAAGACTTTA 


CAAAGGTGAT 


ATGAACACCC 


ATGTTTCTAT 


GCACAGCAGA 


GCATTGAATA 


1020 
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CACAGCATCA CACCAAAAGG TACCGAAGTC CAGTAGGATT CTTGTTACCA CAATCAAAAC 1080 

AAACTCGATT TTCCATGTTG CTACCTAGCT TCTGAAAAAC TTGTTGAGTA GTCTGTTCCG 1140 

TGGCAAATGT TTCTCCTTCA TCGTTACTCA TTGTCGCTAT GTGTATACTA AATTGCTCAA 1200 

GAAGACCGGA TCAACAAGTA CTTAACAAAT ACCCTTTCTT TGCTATCGCC TTGATCTCCT 1260 

TTTATAAAAT GCCAGCTAAA TCGTGTTTAC GAAGAATAGT TGTTTTCTTT tTTTTTTTTT 1320 

TTTTTCGAAA CTTTACCGTG TCGTCGAAAA TGACCAAACG ATGTTACTTT TCCTTTTGTG 1380 

TCATAGATAA TACCAATATT GAAAGTAAAA TTTTAAACAT TCTATAGGTG AATTGAAAAG 1440 

GGCAGCTTAG AGAGTAACAG GGGAACAGCA TTCGTT^CAT CTAGGTACTG GTATTATTTG 1500 

CTGTTTTTTA AAAAAGAAGG AAATCCGTTT TGCAAGAATT GTCTGCTATT TAAGGGTATA 1560 

CGTGCTACGG TCCACTAATC AAAAGTGGTA TCTCATTCTG AAGAAAAAGT GTAAAAAGGA 1620 

CGATAAGGAA AG ATG TCC CAA CGA TCT TCA CAA CAC ATT GTA GGT ATT 1668 
Met Ser Gin Arg Ser Ser Gin His lie Val Gly lie 
15 10 

CAT TAT GOT GTA GGA OCT AAG ATT GGC GAA GGG TCT TTC GGA GTA A.TA 1716 
His Tyr Ala Val Gly Pro Lys lie Gly Glu Gly Ser Phe Gly Val lie 
15 20 25 

TTT GAG GGA GAG AAC ATT CTT CAT TCT TGT CAA GCG CAG ACC GGT AGC 17 64 

Phe Glu Gly Glu Asn lie Leu His Ser Cys Gin Ala Gin Thr Gly Ser 
30 35 40 

AAG AGG GAC TCT AGT ATA ATA ATG GCG AAC GAG CCA GTC GCA ATT AAA 1812 
Lys Arg Asp Ser Ser lie lie Met Ala Asn Glu Pro Val Ala lie Lys 
45 50 55 60 

TTC GAA CCG CGA CAT TCG GAC GCA CCC CAG TTG CGT GAC GAA TTT AG A 1860 
Phe Glu Pro Arg His Ser Asp Ala Pro Gin Leu Arg Asp Glu Phe Arg 
65 70 75 

GCC TAT AGG ATA TTG AAT GGC TGC GTT GGA ATT CCC CAT GCT TAT TAT 1908 
Ala Tyr Arg lie Leu Asn Gly Cys Val Gly lie Pro His Ala Tyr Tyr 
80 85 90 

TTT GGT CAA GAA GGT ATG CAC AAC ATC TTG ATT ATC GAT TTA CTA GGG 1956 
Phe Gly Gin Glu Gly Met His Asn lie Leu lie lie Asp Leu Leu Gly 
95 100 105 

CCA TCA TTG GAA GAT CTC TTT GAG TGG TGT GGT AG A AAA TTT TCA GTG 2004 
Pro Ser Leu Glu Asp Leu Phe Glu Trp Cys Gly Arg Lys Phe Ser Val 
110 115 120 

AAA ACA ACC TGT ATG GTT GCC AAG CAA ATG ATT GAT AGA GTT AGA GCA 2052 
Lys Thr Thr Cys Met Val Ala Lys Gin Met lie Asp Arg Val Arg Ala 
125 130 135 140 

ATT CAT GAT CAC GAC TTA ATC TAT CGC GAT ATT AAA CCC GAT AAC TTT 2100 
lie His Asp His Asp Leu lie Tyr Arg Asp lie Lys Pro Asp Asn Phe 
145 150 155 

TTA ATT TCT CAA TAT CAA AGA ATT TCA CCT GAA GGA AAA GTC ATT AAA 2148 
Leu lie Ser Gin Tyr Gin Arg lie Ser Pro Glu Gly Lys Val lie Lys 
160 165 170 

TCA TGT GCC TCC TCT TCT AAT AAT GAT CCC AAT TTA ATA TAC ATG GTT 2196 
Ser Cys Ala Ser Ser Ser Asn Asn Asp Pro Asn Leu lie Tyr Met Val 
175 180 185 
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GAC TTT GGT ATG GCA AAA CAA TAT AGA GAT CCA AG A ACG AAA CAA CAT 2244 

Asp Phe Gly Met Ala Lys Gin Tyr Arg Asp Pro Arg Thr Lys Gin His 

190 195 200 

ATA CCA TAG CGT GAA CGA AAA TCA TTG AGC GGT ACC GCC AGA TAT ATG 2292 
lie Pro Tyr Arg Glu Arg Lys Ser Leu Ser Gly Thr Ala Arg Tyr Met 
205 210 215 220 

TCT ATT AAT ACT CAT TTT GGA AGA GAA CAG TCA CGT AGG GAT GAT TTA 2340 
Ser lie Asn Thr His Phe Gly Arg Glu Gin Ser Arg Arg Asp Asp Leu 
225 230 235 

GAA TCG CTA GGT CAC GTT TTT TTT TAT TTC TTG AGG GGA TCC TTG CCA 2388 
Glu Ser Leu Gly His Val Phe Phe Tyr Phe Leu Arg Gly Ser Leu Pro 
240 245 250 

TGG CAA GGT TTG AAA GCA CCA AAC AAC AAA CTG AAG TAT GAA AAG ATT 243 6 

Trp Gin Gly Leu Lys Ala Pro Asn Asn Lys Leu Lys Tyr Glu Lys lie 
255 260 265 

GGT ATG ACT AAA CAG AAA TTG AAT CCT GAT GAT CTT TTA TTG AAT AAT 2484 
Gly Met Thr Lys Gin Lys Leu Asn Pro Asp Asp Leu Leu Leu Asn Asn 
270 275 280 

GCT ATT CCT TAT CAG TTT GCC ACA TAT TTA AAA TAT GCA CGT TCC TTG 2532 
Ala lie Pro Tyr Gin Phe Ala Thr Tyr Leu Lys Tyr Ala Arg Ser Leu 
285 290 295 300 

AAG TTC GAC GAA GAT CCG GAT TAT GAC TAT TTA ATC TCG TTA ATG GAT 2580 
Lys Phe Asp Glu Asp Pro Asp Tyr Asp Tyr Leu lie Ser Leu Met Asp 
305 310 315 

GAC GCT TTG AGA TTA AAC GAC TTA AAG GAT GAT GGA CAC TAT GAC TGG 2628 
Asp Ala Leu Arg Leu Asn Asp Leu Lys Asp Asp Gly His Tyr Asp Trp 
320 325 330 

ATG GAT TTG AAT GGT GGT AAA GGC TGG AAT ATC AAG ATT AAT AGA AGA 2676 
Met Asp Leu Asn Gly Gly Lys Gly Trp Asn lie Lys lie Asn Arg Arg 
335 340 345 

GCT AAC TTG CAT GGT TAC GGA AAT CCA AAT CCA AGA GTC AAT GGC AAT 2724 
Ala Asn Leu His Gly Tyr Gly Asn Pro Asn Pro Arg Val Asn Gly Asn 
350 355 360 

ACT GCA AGA AAC AAT GTG AAT ACG AAT TCA AAG ACA CGA AAT ACA ACG 2772 
Thr Ala Arg Asn Asn Val Asn Thr Asn Ser Lys Thr Arg Asn Thr Thr 
365 370 375 380 

CCA GTT GCG ACA CCT AAG CAA CAA GCT CAA AAC AGT TAT AAC AAG GAC 2820 
Pro Val Ala Thr Pro Lys Gin Gin Ala Gin Asn Ser Tyr Asn Lys Asp 
385 390 395 

AAT TCG AAA TCC AGA ATT TCT TCG AAC CCG CAG AGC TTT ACT AAA CAA 2868 
Asn ser Lys ser Arg lie Ser ser Asn Pro Gin Ser Phe Thr Lys Gin 
400 405 410 

CAA CAC GTC TTG AAA AAA ATC GAA CCC AAT AGT AAA TAT ATT CCT GAA 2916 
Gin His Val Leu Lys Lys lie Glu Pro Asn Ser Lys Tyr lie Pro Glu 
415 420 425 

ACA CAT TCA AAT CTT CAA CGG CCA ATT AAA AGT CAA AGT CAA ACG TAC 2964 
Thr His Ser Asn Leu Gin Arg Pro lie Lys Ser Gin Ser Gin Thr Tyr 
430 435 440 

GAC TCC ATC AGT CAT ACA CAA AAT TCA CCA TTT GTA CCA TAT TCA AGT 3012 
Asp Ser lie Ser His Thr Gin Asn Ser Pro Phe Val Pro Tyr Ser Ser 
445 450 455 460 
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TCT AAA GCT AAC CCT AAA AGA AGT AAT AAT GAG CAC AAC TTA CCA AAC 3060 

Ser Lys Ala Asn Pro Lys Arg Ser Asn Asn Glu His Asn Leu Pro Asn 
465 470 475 

CAC TAC AC A AAC CTT GCA AAT AAG AAT ATC AAT TAT CAA AGT CAA CGA 3108 

His Tyr Thr Asn Leu Ala Asn Lys Asn lie Asn Tyr Gin Ser Gin Arg 
480 485 490 

AAT TAC GAA CAA GAA AAT GAT GCT TAT TCT GAT GAC GAG AAT GAT ACA 3156 

Asn Tyr Glu Gin Glu Asn Asp Ala Tyr Ser Asp Asp Glu Asn Asp Thr 
495 500 505 

TTT TGT TCT AAA ATA TAC AAA TAT TGT TGT TGC TGT TTT TGT TGC TGT 3204 

Phe Cys Ser Lys lie Tyr Lys Tyr Cys Cys Cys Cys Phe Cys Cys Cys 

510 515 520 



TGATAAAGCG 


ATTTTTATAC 


TTTTCTCTTT 


TTCCTTTTTT 


TTTTTGATTG 


GCTGTTTCCT 


3264 


TATGCCGCTC 


TTTCCCAATT 


TATGACTTTC 


CAATAATGTA 


TTATTTTGTT 


TCTCTTTCTC 


3324 


TCTGTTACCC 


TTTATTTTAT 


CATCTACAAT 


AATTGAATTC 


CGGAGAGGGT 


AAAGAAACAG 


3384 


GAAAAAGAAG 


AAAATGAGAC 


ATAGTCAGCA 


TCGTAATCGT 


TTTCCTTCTG 


TATATTCCTT 


3444 


TATCAAAAGA 


CTACACGCAC 


ATATATATTA 


ATCCCGGTAT 


GTTTTTGGTG 


TGCTAAATCT 


3504 


ATCTTCAAGC 


ACTATTATAG 


CATTTTTTTA 


AGAATATCCA 


AAATAATATG 


TAATTTATGA 


3564 


TTAATCAAGG 


TTCAAGAATT 


GGAGAAACCG 


TGAGCGACTT 


CTTTGATACT 


TGGATGTAAG 


3624 


CTT 












3627 



(2) INFORMATION FOR SEQ ID NO: 24: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 524 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Ser Gin Arg Ser Ser Gin His He Val Gly He His Tyr Ala Val 
1 5 10 15 

Gly Pro Lys He Gly Glu Gly Ser Phe Gly Val He Phe Glu Gly Glu 
20 25 30 

Asn He Leu His Ser Cys Gin Ala Gin Thr Gly Ser Lys Arg Asp Ser 
35 40 45 

Ser He He Met Ala Asn Glu Pro Val Ala He Lys Phe Glu Pro Arg 
50 55 60 

His Ser Asp Ala Pro Gin Leu Arg Asp Glu Phe Arg Ala Tyr Arg He 
65 70 75 80 

Leu Asn Gly Cys Val Gly He Pro His Ala Tyr Tyr Phe Gly Gin Glu 
85 90 95 

Gly Met His Asn He Leu He He Asp Leu Leu Gly Pro Ser Leu Glu 
100 105 110 

Asp Leu Phe Glu Trp Cys Gly Arg Lys Phe Ser Val Lys Thr Thr Cys 
115 120 125 
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Met Val Ala Lys Gin Met lie Asp Arg Val Arg Ala lie His Asp His 
130 135 140 

Asp Leu lie Tyr Arg Asp lie Lys Pro Asp Asn Phe Leu lie Ser Gin 
145 150 155 160 

Tyr Gin Arg lie Ser Pro Glu Gly Lys Val lie Lys Ser Cys Ala Ser 
165 170 175 

Ser Ser Asn Asn Asp Pro Asn Leu lie Tyr Met Val Asp Phe Gly Met 
180 185 190 

Ala Lys Gin Tyr Arg Asp Pro Arg Thr Lys Gin His lie Pro Tyr Arg 
195 200 205 

Glu Arg Lys Ser Leu ser Gly Thr Ala Arg Tyr Met Ser lie Asn Thr 
210 215 220 

His Phe Gly Arg Glu Gin Ser Arg Arg Asp Asp Leu Glu Ser Leu Gly 
225 230 235 240 

His Val Phe Phe Tyr Phe Leu Arg Gly Ser Leu Pro Trp Gin Gly Leu 
245 250 255 

Lys Ala Pro Asn Asn Lys Leu Lys Tyr Glu Lys lie Gly Met Thr Lys 
260 265 270 

Gin Lys Leu Asn Pro Asp Asp Leu Leu Leu Asn Asn Ala lie Pro Tyr 
275 280 285 

Gin Phe Ala Thr Tyr Leu Lys Tyr Ala Arg Ser Leu Lys Phe Asp Glu 
290 295 300 

Asp Pro Asp Tyr Asp Tyr Leu lie Ser Leu Met Asp Asp Ala Leu Arg 
305 310 315 320 

Leu Asn Asp Leu Lys Asp Asp Gly His Tyr Asp Trp Met Asp Leu Asn 

325 330 335 

Gly Gly Lys Gly Trp Asn lie Lys lie Asn Arg Arg Ala Asn Leu His 
340 345 350 

Gly Tyr Gly Asn Pro Asn Pro Arg Val Asn Gly Asn Thr Ala Arg Asn 
355 360 365 

Asn Val Asn Thr Asn Ser Lys Thr Arg Asn Thr Thr Pro Val Ala Thr 
370 375 380 

Pro Lys Gin Gin Ala Gin Asn Ser Tyr Asn Lys Asp Asn Ser Lys Ser 
385 390 395 400 

Arg lie Ser Ser Asn Pro Gin Ser Phe Thr Lys Gin Gin His Val Leu 
405 410 415 

Lys Lys lie Glu Pro Asn Ser Lys Tyr lie Pro Glu Thr His Ser Asn 
420 425 430 

Leu Gin Arg Pro lie Lys Ser Gin Ser Gin Thr Tyr Asp Ser lie Ser 
435 440 445 

His Thr Gin Asn Ser Pro Phe Val Pro Tyr' Ser Ser Ser Lys Ala Asn 
450 455 460 

Pro Lys Arg Ser Asn Asn Glu His Asn Leu Pro Asn His Tyr Thr Asn 
465 470 475 480 
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Leu Ala- Asn Lys Asn lie Asn Tyr Gin Ser Gin Arg Asn Tyr Glu Gin 
485 490 495 

Glu Asn Asp Ala Tyr Ser Asp Asp Glu Asn Asp Thr Phe Cys Ser Lys 
500 505 510 

lie Tyr Lys Tyr Cys Cys Cys Cys Phe Cys Cys Cys 
515 520 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

( ix ) FEATURE : 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1..6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Gly Pro Ser Leu Glu Asp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

( ix ) FEATURE : 

(A) NAME/KEY: Peptide 

(B) LOCATION: l.,9 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Arg Asp lie Lys Pro Asp Asn Phe Leu 

1 5 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: Protein Kinase 

( ix ) FEATURE : 

(A) NAME/KEY: Peptide 

(B) LOCATION: l.,6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

His He Pro Tyr Arg Glu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 28: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(C) OTHER INFORMATION: /note= "The nucleotide at this position 
is inosine." 

(ix) FEATURE: 

(A) NAME/KEY: Mcdif ied-site 

(B) LOCATION: 9 

(C) OTHER INFORMATION: /note= "The nucleotide at this position 
is inosine." 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 12 

(C) OTHER INFORMATION: /note= "The nucleotide at this position 
is inosine," 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 15 

(C) OTHER INFORMATION: /note= "The nucleotide at this position 
is inosine." 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 21 

(C) OTHER INFORMATION: /note= "The nucleotide at this position 
is inosine." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

GARYTNMGNY TNGGNAAYYT N 21 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: eingl 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

(C) OTHER INFORMATION: /note= "The nucleotide at this position 
is inosine." 

<ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 12 

(C) OTHER INFORMATION: /note= "The nucleotide at this position 
is inosine," 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 15 

(C) OTHER INFORMATION: /note= "The nucleotide at this position 
is inosine." 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 18 

(C) OTHER INFORMATION: /note= "The nucleotide at this position 
is inosine," 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 21 

(C) OTHER INFORMATION: /note= "The nucleotide at this position 
is inosine," 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GTYTTRTTNC CNGGNCKNCC NAT 23 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 67 •.1197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

AAAGTGGAGT ACCGCAAACT TGATATGGAA AATAAAAAGA AAGACAAGGA CAAATCAGAT 60 

GATAGA ATG GCA CGA CCT AGT GGT CGA TCG GGA CAC AAC ACT CGA GGA 108 
Met Ala Arg Pro Ser Gly Arg Ser Gly His Asn Thr Arg Gly 
1 5 10 
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ACT GGG TCT TCA TCG TCT GGA GTT TTA ATG GTT GGA CCT AAC TTT AGA 156 

Thr Gly Ser Ser Ser Ser Gly Val Leu Met Val Gly Pro Asn Phe Arg 
15 20 25 30 

GTT GGA AAA AAA ATT GGA TGT GGC AAT TTT GGA GAA TTA CGA TTA GGG 204 
Val Gly Lys Lys lie Gly Cys Gly Asn Phe Gly Glu Leu Arg Leu Gly 
35 40 45 

AAA AAT TTA TAG AC A AAT GAA TAT GTG GGA ATT AAG TTG GAG CCC ATG 252 
Lys Asn Leu Tyr Thr Asn Glu Tyr Val Ala lie Lys Leu Glu Pro Met 
50 55 60 

AAA TCA AGA GCA CCA CAG CTA CAT TTG GAA TAG AGA TTC TAT AAG CAG 300 
Lys Ser Arg Ala Pro Gin Leu His Leu Glu Tyr Arg Phe Tyr Lys Gin 
65 70 75 

TTA GGA TCT GGA GAT GGT ATA CCT CAA GTT TAC TAT TTC GGC CCC TGT 348 
Leu Gly Ser Gly Asp Gly lie Pro Gin Val Tyr Tyr Phe Gly Pro Cys 
80 85 90 

GGT AAA TAC AAT GCT ATG GTG CTG GAA CTG CTG GGA CCT AGT TTG GAA 396 
Gly Lys Tyr Asn Ala Met val Leu Glu Leu Leu Gly Pro Ser Leu Glu 
95 100 105 110 

GAC TTG TTT GAC TTG TGT GAC AGA ACA TTT TCT CTT AAA ACA GTT CTC 444 
Asp Leu Phe Asp Leu Cys Asp Arg Thr Phe Ser Leu Lys Thr Val Leu 
115 120 125 

ATG ATA GCT ATA CAA CTG ATT TCT CGC ATG GAA TAT GTC CAT TCA AAG 492 
Met lie Ala lie Gin Leu He Ser Arg Met Glu Tyr Val His Ser Lys 
130 135 140 

AAC TTG ATA TAC AGA GAT GTA AAA CCT GAG AAC TTC TTA ATA GGA CGA 540 
Asn Leu He Tyr Arg Asp Val Lys Pro Glu Asn Phe Leu lie Gly Arg 
145 150 155 

CCA GGA AAC AAA ACC CAG CAA GTT ATT CAC ATT ATA GAT TTT GGT TTG 588 
Pro Gly Asn Lys Thr Gin Gin Val He His He He Asp Phe Gly Leu 
160 165 170 

GCA AAG GAA TAT ATT GAT CCG GAG ACA AAG AAA CAC ATA CCA TAC AGA 636 
Ala Lys Glu Tyr He Asp Pro Glu Thr Lys Lys His He Pro Tyr Arg 
175 180 185 190 

GAA CAC AAG AGC CTT ACA GGA ACA GCT AGA TAT ATG AGC ATA AAC ACA 684 
Glu His Lys Ser Leu Thr Gly Thr Ala Arg Tyr Met Ser He Asn Thr 
195 200 205 

CAT TTA GGA AAA GAA CAA AGT AGA AGA GAC GAT TTA GAA GCT TTA GGT 732 
His Leu Gly Lys Glu Gin Ser Arg Arg Asp Asp Leu Glu Ala Leu Gly 
210 215 220 

CAT ATG TTC ATG TAT TTT CTG AGA GGC AGT CTT CCT TGG CAA GGC TTA 780 
His Met Phe Met Tyr Phe Leu Arg Gly Ser Leu Pro Trp Gin Gly Leu 
225 230 235 

AAG GCT GAC ACA TTA AAG GAG AGG TAT CAG AAA ATT GGA GAT ACA AAA 828 
Lys Ala Asp Thr Leu Lys Glu Arg Tyr Gin Lys He Gly Asp Thr Lys 
240 245 250 

CGG GCT ACA CCA ATA GAA GTG TTA TGT GAA AAT TTT CCA GAA GAA ATG 87 6 

Arg Ala Thr Pro He Glu Val Leu Cys Glu Asn Phe Pro Glu Glu Met 
255 260 265 270 

GCA ACA TAT CTT CGT TAT GTA AGA AGG CTA GAT TTT TTT GAA AAA CCA 924 
Ala Thr Tyr Leu Arg Tyr Val Arg Arg Leu Asp Phe Phe Glu Lys Pro 
275 280 285 
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GAC TAT GAC TAC TTA AGA AAG CTT TTT ACT GAC TTG TTT GAT CGA AAA 972 
Asp Tyr Asp Tyr Leu Arg Lys Leu Phe Thr Asp Leu Phe Asp Arg Lys 
290 295 300 

GGA TAT ATG TTT GAT TAT GAA TAT GAC TGG ATT GOT AAA CAG TTG CCT 1020 
Gly Tyr Met Phe Asp Tyr Glu Tyr Asp Trp lie Gly Lys Gin Leu Pro 
305 310 315 

ACT CCA GTG GGT GCA GTT CAG CAA GAT CCT GCT CTG TCA TCA AAC AGA 1068 
Thr Pro Val Gly Ala Val Gin Gin Asp Pro Ala Leu Ser Ser Asn Arg 
320 325 330 

GAA GCA CAT CAA CAC AGA GAT AAG ATG CAA CAA TCC AAA AAC CAG GTT 1116 
Glu Ala His Gin His Arg Asp Lys Met Gin Gin Ser Lys Asn Gin Val 
335 340 345 350 

GTA AGT TCT AC A AAT GGA GAG TTA AAC ACA GAT GAC CCC ACC GCA GAC 1164 
Val Ser Ser Thr Asn Gly Glu Leu Asn Thr Asp Asp Pro Thr Ala Asp 
355 360 365 

GTT CAA ATG CAC CCA TCA CAG CCC CTA CTG AAG TAGAAGTGAT GGATGAAACC 1217 
Val Gin Met His Pro Ser Gin Pro Leu Leu Lys 
370 375 

AACTGCCAGA AAGTGTTGAA CATGTGGTGC TGCTGTTTTT TCAAACGAAG GAAAAGGAAA 1277 

ACCATACAGC GCCACAAATG ACTCTGGACA CAGACAGATC CTGGGGAGTT ACTTACATGT 1337 

TCATCTGCTG TCTTGTGATT AAAATCATCT CTGTAGTGAC CACGTATATT TTCAAGGACT 1397 

CACTCTTAGA AACAAAAATG TCATACTTTC ATACTTCATT TTGTGGTTGT CTTACATTCT 1457 

TTTTCTTTTT TTTTTTCTCT AATTTAACCT TTATGGAAGC TTTAAAGTTT TGTCAAAAAC 1517 

ATGAGTGCTT TTGCCCCATC AGTGAATGGA ATGGACCAAT GAGGTGGTAT CAATGAATAT 1577 

AGTTCCATAG AACATTTCCA GAAGTTCTTC TGTTGTAGAA AGCAGTACAG TATCTTAAGT 1637 

GTCAACCAGT TATATACCTA ATCTGGTTTT TTATAACTTC TGTAAGAGCA TAATCAAACA 1697 

GGAATTTTCT TTTCTCAGTG GATAATACAA CAGAGAAAAC AGAGTTGCCC AAATATTTAA 1757 

AAGAAGTTAT TCCTTGAGAA GTTCATATTT TGTGACATCT GCATTGATTT CAGTATTACT 1817 

GATGGTACTG TTATTCATAA GTCATATTAA CATTCTCTCC GTGAAATCAT GGTACAGTCG 1877 

CTGCCCAGAG GTACTGAGGA AAAAGCAATA TGGGTTCGGC AGATGGTGGT GGTAAAATGA 1937 

ATCTTAAGGA GTGTGGTAAA TATGCGTCCG CTTTTGTTGC ATCACTATGT GAAGTACTGT 1997 

GTTGCAGAAG TGGCAAAAGC GCTTATTTTT AAAAATGCAA AATATTTGTA CAATGTAACT 2057 

TTATGCTTCC AAATAATAAT GTATGTTAGA CAGCAAGAAA TGAATACTTT AAAAAGTGAT 2117 

GTATGTTGGA GTTATAAAGA AATACACTAA GGAGAGGTAG TAAATGTGAA CCTTGTTGCA 2177 

GTGTATAAGG TGGAAGCCTA AAGAAATCTC ACCGAAACTT ACTGCTGAAT GATTACATTC 2237 

TCCCTTAAGC AGAAAACTTT GGATGTGCCA TGCAATGGTG TCTGTGTAAT TATTTTGCTC 2297 

TTTGATTAAA AAAAAGACCC CCAGCAATAA AAAGTGGGTC ACTCTAAAAA AAAAAAAAAA 2357 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA ACGACAGCAA CGGAATTC 2405 
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(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Ala Arg Pro Ser Gly Arg Ser Gly His Asn Thr Arg Gly Thr Gly 
15 10 15 

Ser Ser Ser Ser Gly Val Leu Met Val Gly Pro Asn Phe Arg Val Gly 
20 25 30 

Lye Lys lie Gly Cys Gly Asn Phe Gly Glu Leu Arg Leu Gly Lys Asn 
35 40 45 

Leu Tyr Thr Asn Glu Tyr Val Ala lie Lye Leu Glu Pro Met Lys Ser 
50 55 60 

Arg Ala Pro Gin Leu His Leu Glu Tyr Arg Phe Tyr Lys Gin Leu Gly 
65 70 75 80 

ser Gly Asp Gly He Pro Gin Val Tyr Tyr Phe Gly Pro Cys Gly Lys 
85 90 95 

Tyr Asn Ala Met Val Leu Glu Leu Leu Gly Pro Ser Leu Glu Asp Leu 
100 105 110 

Phe Asp Leu Cys Asp Arg Thr Phe Ser Leu Lys Thr Val Leu Met He 
115 120 125 

Ala He Gin Leu He Ser Arg Met Glu Tyr Val His Ser Lys Asn Leu 
130 135 140 

He Tyr Arg Asp Val Lys Pro Glu Asn Phe Leu He Gly Arg Pro Gly 
145 150 155 160 

Asn Lys Thr Gin Gin Val He His He He Asp Phe Gly Leu Ala Lys 
165 170 175 

Glu Tyr He Asp Pro Glu Thr Lys Lys His He Pro Tyr Arg Glu His 
180 185 190 

Lys Ser Leu Thr Gly Thr Ala Arg Tyr Met Ser He Asn Thr His Leu 
195 200 205 

Gly Lys Glu Gin Ser Arg Arg Asp Asp Leu Glu Ala Leu Gly His Met 
210 215 220 

Phe Met Tyr Phe Leu Arg Gly Ser Leu Pro Trp Gin Gly Leu Lys Ala 
225 230 235 240 

Asp Thr Leu Lys Glu Arg Tyr Gin Lys He Gly Asp Thr Lys Arg Ala 
245 250 255 

Thr Pro He Glu Val Leu Cys Glu Asn Phe Pro Glu Glu Met Ala Thr 
260 265 270 

Tyr Leu Arg Tyr Val Arg Arg Leu Asp Phe Phe Glu Lys Pro Asp Tyr 
275 280 285 

Asp Tyr Leu Arg Lys Leu Phe Thr Asp Leu Phe Asp Arg Lys Gly Tyr 
290 295 300 
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Met Ph Asp Tyr Giu Tyr Asp Trp He Gly Lys Gin Leu Pro Thr Pro 
305 310 315 320 

Val Gly Ala Val Gin Gin Asp Pro Ala Leu Ser Ser Asn Arg Glu Ala 
325 330 335 

His Gin His Arg Asp Lys Met Gin Gin Ser Lys Asn Gin Val Val Ser 
340 345 350 

Ser Thr Asn Gly Glu Leu Asn Thr Asp Asp Pro Thr Ala Asp Val Gin 
355 360 365 

Met His Pro Ser Gin Pro Leu Leu Lys 
370 375 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

. (ii) MOLECULE TYPE: cDNA 



( ix ) FEATURE : 

(A) NAME /KEY: CDS 

( B ) LOCATION : 1 . . 1041 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

AGA GTT GGA AAA AAA ATT GGA TGT GGC AAT TTT GGA GAA TTA CGA TTA 48 

Arg Val Gly Lys Lys lie Gly Cys Gly Asn Phe Gly Glu Leu Arg Leu 

15 10 15 

GGG AAA AAT TTA TAC ACA AAT GAA TAT GTG OCA ATT AAG TTG GAG CCC 96 

Gly Lys Asn Leu Tyr Thr Asn Glu Tyr Val Ala lie Lys Leu Glu Pro 

20 25 30 

ATG AAA TCA AGA GCA CCA CAG CTA CAT TTG GAA TAC AGA TTC TAT AAG 144 

Met Lys Ser Arg Ala Pro Gin Leu His Leu Glu Tyr Arg Phe Tyr Lys 
35 40 45 

CAG TTA GGA TCT GGA GAT GGT ATA CCT CAA GTT TAC TAT TTC GGC CCT 192 

Gin Leu Gly Ser Gly Asp Gly lie Pro Gin Val Tyr Tyr Phe Gly Pro 
50 55 60 

TGT GGT AAA TAC AAT GOT ATG GTG CTG GAA CTG CTG GGA CCT AGT TTG 240 

Cys Gly Lys Tyr Asn Ala Met Val Leu Glu Leu Leu Gly Pro Ser Leu 

65 70 75 80 

GAA GAC TTG TTT GAC TTG TGT GAC AGA ACA TTT TCT CTT AAA ACA GTT 288 

Glu Asp Leu Phe Asp Leu Cys Asp Arg Thr Phe Ser Leu Lys Thr Val 

85 90 95 

CTC ATG ATA GCT ATA CAA CTG ATT TCT CGC ATG GAA TAT GTC CAT TCA 336 

Leu Met lie Ala lie Gin Leu lie Ser Arg Met Glu Tyr Val His Ser 

100 105 110 

AAG AAC TTG ATA TAC AGA GAT GTA AAA CCT GAG AAC TTC TTA ATA GGA 384 

Lye Asn Leu lie Tyr Arg Asp Val Lys Pro Glu Asn Phe Leu lie Gly 
115 120 125 
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CGA CCA GGA AAC AAA ACC CAG CAA GTT ATT CAC ATT ATA GAT TTT GGT 432 
Arg Pro Gly Asn Lys Thr Gin Gin Val lie His lie lie Asp Phe Gly 
130 135 140 

TTG GCA AAG GAA TAT ATT GAT CCG GAG ACA AAG AAA CAC ATA CCA TAC 480 
Leu Ala Lys Glu Tyr lie Asp Pro Glu Thr Lys Lys His lie Pro Tyr 
145 150 155 160 

AGA GAA CAC AAG AGC CTT ACA GGA ACA GCT AG A TAT ATG AGC ATA AAC 528 
Arg Glu His Lys Ser Leu Thr Gly Thr Ala Arg Tyr Met Ser lie Asn 
165 170 175 

ACA CAT TTA GGA AAA GAA CAA AGT AGA AGA GAC GAT TTA GAA GCT TTA 57 6 

Thr His Leu Gly Lys Glu Gin Ser Arg Arg Asp Asp Leu Giu Ala Leu 
180 185 190 

GGT CAT ATG TTC ATG TAT TTT CTG AGA GGC AGT CTT CCT TGG CAA GGC 624 
Gly His Met Phe Met Tyr Phe Leu Arg Gly Ser Leu Pro Trp Gin Gly 
195 200 205 

TTA AAG GTT GAC ACA TTA AAG GAG AGG TAT CAG AAA ATT GGA GAT ACA 672 
Leu Lys Val Asp Thr Leu Lys Glu Arg Tyr Gin Lys lie Gly Asp Thr 
210 215 220 

AAA CGG GCT ACA CCA ATA GAA GTG TTA TGT GAA AAT TTT CCA GAA ATG 72 0 

Lys Arg Ala Thr Pro lie Glu Val Leu Cys Glu Asn Phe Pro Glu Met 
225 230 235 240 

GCA ACA TAT CTT CGT TAT GTA AGA AGG CTA GAT TTT TTT GAA AAA CCA 768 
Ala Thr Tyr Leu Arg Tyr Val Arg Arg Leu Asp Phe Phe Glu Lys Pro 
245 250 255 

GAC TAT GAC TAC TTA AGA AAG CTT TTT ACT GAC TTG TTT GAT CGA AAA 816 
Asp Tyr Asp Tyr Leu Arg Lys Leu Phe Thr Asp Leu Phe Asp Arg Lys 
260 265 270 

GGA TAT ATG TTT GAT TAT GAA TAT GAC TGG ATT GGT AAA CAG TTG CCT 864 
Gly Tyr Met Phe Asp Tyr Glu Tyr Asp Trp lie Gly Lys Gin Leu Pro 
275 280 285 

ACT CCA GTG GGT GCA GTT CAG CAA GAT CCT GCT CTG TCA TCA AAC AGA 912 
Thr Pro Val Gly Ala Val Gin Gin Asp Pro Ala Leu Ser Ser Asn Arg 
290 295 300 

GAA GCA CAT CAA CAC AGA GAT AAG ATG CAA CAA TCC AAA AAC CAG GTT 960 
Glu Ala His Gin His Arg Asp Lys Met Gin Gin Ser Lys Asn Gin Val 
305 310 315 320 

GTA AGT TCT ACA AAT GGA GAG TTA AAC ACA GAT GAC CCC ACC GCA GAC 1008 
Val Ser Ser Thr Asn Gly Glu Leu Asn Thr Asp Asp Pro Thr Ala Asp 
325 330 335 

GTT CAA ATG CAC CCA TCA CAG CCC CTA CTG AAG TAGAAGTGAT GGATG/U^CC 1061 
Val Gin Met His Pro Ser Gin Pro Leu Leu Lys 
340 345 

AACTGCCAGA AAGTGTTGAA CATGTGGTGC TGCTGTTTTT TCAAACGAAG GAAAAGGAAA 1121 

ACCATACAGC GCCACAAATG ACTCTGGACA CAGACAGATC CTGGGGAGTT ACTTACATGT 1181 

TCATCTGCTG TCTTGTGATT AAATCATCTC TGTAGTGACC ACGTATATTT TC 1233 
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(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 347 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Arg Val Gly Lys Lys lie Gly Cys Gly Asn Phe Gly Glu Leu Arg Leu 
15 10 15 

Gly Lys Asn Leu Tyr Thr Asn Glu Tyr Val Ala lie Lys Leu Glu Pro 
20 25 30 

Met Lys Ser Arg Ala Pro Gin Leu His Leu Glu Tyr Arg Phe Tyr Lys 
35 40 45 

Gin Leu Gly Ser Gly Asp Gly lie Pro Gin Val Tyr Tyr Phe Gly Pro 
50 55 60 

Cys Gly Lys Tyr Asn Ala Met Val Leu Glu Leu Leu Gly Pro Ser Leu 
65 70 75 80 

Glu Asp Leu Phe Asp Leu Cys Asp Arg Thr Phe Ser Leu Lys Thr Val 
85 90 95 

Leu Met lie Ala lie Gin Leu lie Ser Arg Met Glu Tyr Val His Ser 
100 105 110 

Lys Asn Leu lie Tyr Arg Asp Val Lys Pro Glu Asn Phe Leu lie Gly 
115 120 125 

Arg Pro Gly Asn Lys Thr Gin Gin Val lie His He He Asp Phe Gly 
130 135 140 

Leu Ala Lys Glu Tyr He Asp Pro Glu Thr Lys Lys His He Pro Tyr 
145 150 155 160 

Arg Glu His Lys Ser Leu Thr Gly Thr Ala Arg Tyr Met Ser He Asn 
165 170 175 

Thr His Leu Gly Lys Glu Gin Ser Arg Arg Asp Asp Leu Glu Ala Leu 
180 185 190 

Gly His Met Phe Met Tyr Phe Leu Arg Gly Ser Leu Pro Trp Gin Gly 
195 200 205 

Leu Lys Val Asp Thr Leu Lys Glu Arg Tyr Gin Lys He Gly Asp Thr 
210 215 220 

Lys Arg Ala Thr Pro He Glu Val Leu Cys Glu Asn Phe Pro Glu Met 
225 230 235 240 

Ala Thr Tyr Leu Arg Tyr Val Arg Arg Leu Asp Phe Phe Glu Lys Pro 
245 250 255 

Asp Tyr Asp Tyr Leu Arg Lys Leu Phe Thr Asp Leu Phe Asp Arg Lys 
260 265 270 

Gly Tyr Met Phe Asp Tyr Glu Tyr Asp Trp He Gly Lys Gin Leu Pro 
275 280 285 

Thr Pro Val Gly Ala Val Gin Gin Asp Pro Ala Leu Ser Ser Asn Arg 
290 295 300 
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Giu Ala His Gin His Arg Asp Lvs Met Gin Gin Ser Lys Asn Gin Val 
305 310 315 320 

Val Ser Ser Thr Asn Gly Glu Leu Asn Thr Asp Asp Pro Thr Ala Asp 
325 330 335 

Val Gin Met His Pro Ser Gin Pro Leu Leu Lys 
340 345 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

< ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 154.. 1398 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

GAATTCCGAC AGGAAAGCGA TGGTGAAAGC GGGGCCGTGA GGGGGGCGGA GCCGGGAGCC 60 

GGACCCGCAG TAGCGGCAGC AGCGGCGCCG CCTCCCGGAG TTCAGACCCA GGAAGCGGCC 120 

GGGAGGGCAG GAGCGAATCG GGCCGCCGCC GCC ATG GAG CTG AG A GTC GGG AAC 174 

Met Glu Leu Arg Val Gly Asn 
1 5 

AGG TAG CGG CTG GGC CGG AAG ATC GGC AGC GGC TCC TTC GGA GAC ATC 222 
Arg Tyr Arg Leu Gly Arg Lys lie Gly Ser Gly Ser Phe Gly Asp lie 
10 15 20 

TAT CTC GGT ACG GAC ATT GCT GCA GGA GAA GAG GTT GCC ATC AAG CTT 270 
Tyr Leu Gly Thr Asp lie Ala Ala Gly Glu Glu Val Ala lie Lys Leu 
25 30 35 

GAA TGT GTC AAA ACC AAA CAC CCT CAG CTC CAC ATT GAG AGC AAA ATC 318 
Glu Cys Val Lys Thr Lys His Pro Gin Leu His lie Glu Ser Lys lie 
40 45 50 55 

TAG AAG ATG ATG CAG GGA GGA GTG GGC ATC CCC ACC ATC AGA TGG TGC 366 
Tyr Lys Met Met Gin Gly Gly Val Gly lie Pro Thr lie Arg Trp Cys 
60 65 70 

GGG GCA GAG GGG GAC TAC AAC GTC ATG GTG ATG GAG CTG CTG GGG CCA 414 
Gly Ala Glu Gly Asp Tyr Asn Val Met Val Met Glu Leu Leu Gly Pro 
75 80 85 

AGC CTG GAG GAC CTC TTC AAC TTC TGC TCC AGG AAA TTC AGC CTC AAA 462 
Ser Leu Glu Asp Leu Phe Asn Phe Cys Ser Arg Lys Phe Ser Leu Lys 
90 95 100 

ACC GTC CTG CTG CTT GCT GAC CAA ATG ATC AGT CGC ATC GAA TAC ATT 510 
Thr Val Leu Leu Leu Ala Asp Gin Met lie Ser Arg He Glu Tyr He 
105 110 115 

CAT TCA AAG AAC TTC ATC CAC CGG GAT GTG AAG CCA GAC AAC TTC CTC 558 
His Ser Lys Asn Phe He His Arg Asp Val Lys Pro Asp Asn Phe Leu 
120 125 130 135 
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ATG GGC CTG GGG AAG AAG GGC AAC CTG GTG TAG ATC ATC GAG TTC GGG 605 

Met Glv Leu Gly Lys Lys Gly Asn Leu Val Tyr He He Asp Phe Gly 

140 145 150 

CTG GCC AAG AAG TAG GGG GAT GCA GGC ACC CAC GAG CAC ATC CCC TAT 654 
Leu Ala Lys Lys Tyr Arg Asp Ala Arg Thr His Gin His lie Pro Tyr 
155 160 165 

CGT GAG AAC AAG AAC CTC ACG GGG ACG GGG CGG TAG GCC TCC ATC AAC 702 
Arg Glu Asn Lys Asn Leu Thr Gly Thr Ala Arg Tyr Ala Ser He Asn 
170 175 180 

ACG CAC CTT GGA ATT GAA CAA TCC CGA AGA GAT GAG TTG GAG TCT CTG 7 50 

Thr His Leu Gly He Glu Gin Ser Arg Arg Asp Asp Leu Giu Ser Leu 
185 190 195 

GGC TAG GTG CTA ATG TAG TTC AAC CTG GGC TCT CTC CCC TGG GAG GGG 798 
Gly Tyr Val Leu Met Tyr Phe Asn Leu Gly Ser Leu Pro Trp Gin Gly 
20D 205 210 215 

CTG AAG GCT GCC ACC AAG AGA GAG AAA TAG GAA AGG ATT AGC GAG AAG 846 
Leu Lys Ala Ala Thr Lys Arg Gin Lys Tyr Glu Arg He Ser Glu Lys 
220 225 230 

AAA ATG TCC ACC CCC ATC GAA GTG TTG TGT AAA GGC TAG CGT TCC GAA 894 
Lys Met Ser Thr Pro He Glu Val Leu Cys Lys Gly Tyr Pro Ser Glu 
235 240 245 

TTT GCC ACA TAG CTG AAT TTC TGG CGT TCC TTG CGT TTT GAG GAC AAG 942 
Phe Ala Thr Tyr Leu Asn Phe Cys Arg Ser Leu Arg Phe Asp Asp Lys 
250 255 260 

CCT GAC TAG TCG TAG CTG CGG CAG CTT TTC CGG AAT CTG TTC CAT GGC 990 
Pro Asp Tyr Ser Tyr Leu Arg Gin Leu Phe Arg Asn Leu Phe His Arg 
265 270 275 

CAG GGC TTC TCC TAT GAC TAG GTG TTC GAC TGG AAC ATG CTC AAA TTT 1038 
Gin Gly Phe Ser Tyr Asp Tyr Val Phe Asp Trp Asn Met Leu Lys Phe 
280 285 290 295 

GGT GCC AGC CGG GCC GCC GAT GAC GCC GAG CGG GAG CGC AGG GAC CGA 1086 

Gly Ala Ser Arg Ala Ala Asp Asp Ala Glu Arg Glu Arg Arg Asp Arg 
300 305 310 

GAG GAG CGG CTG AGA CAC TCG CGG AAC CCG GCT ACC CGC GGC CTC CCT 1134 
Glu Glu Arg Leu Arg His Ser Arg Asn Pro Ala Thr Arg Gly Leu Pro 
315 320 325 

TCC ACA GCC TCC GGC CGC CTG CGG GGG ACG CAG GAA GTG GCT CCC CCC 1182 
Ser Thr Ala Ser Gly Arg Leu Arg Gly Thr Gin Glu Val Ala Pro Pro 
330 335 340 

ACA CCC CTC ACC CCT ACC TCA CAC ACG GCT AAC ACC TCC CCC CGG CCC 1230 
Thr Pro Leu Thr Pro Thr Ser His Thr Ala Asn Thr Ser Pro Arg Pro 
345 350 355 

GTG TCC GGC ATG GAG AGA GAG CGG AAA GTG AGT ATG CGG CTG CAC CGC 1278 
Val Ser Gly Met Glu Arg Glu Arg Lys Val Ser Met Arg Leu His Arg 
360 365 370 375 

GGG GCC CCC GTG AAC ATC TCC TCG TCC GAC CTC ACA GGC CGA CAA GAT 1326 
Gly Ala Pro Val Asn He Ser Ser Ser Asp Leu Thr Gly Arg Gin Asp 
380 385 390 

ACC TCT CGC ATG TCC ACC TCA CAG ATT CCT GGT CGG GTG GCT TCC AGT 1374 
Thr Ser Arg Met Ser Thr Ser Gin He Pro Gly Arg Val Ala Ser Ser 
395 400 405 
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GGT CTT CAG TCT GTC GTG CAC CGA TGAGAACTCT CCTTATTGCT GTGAAGGGCA 142 8 
Giy Leu Gin Ser Val Val His Arg 
410 415 

GACAATGCAT GGCTGATCTA CTCTGTTACC AATGGCTTTA CTAGTGACAC GTCCCCCGGT 1488 

CTAGGATCGA AATGTTAACA CCGGGAGCTC TCCAGGCCAC TCACCCAGCG ACGCTCGTGG 1548 

GGGAAACATA CTAAACGGAC AGACTCCAAG AGCTGCCACC GCTGGGGCTG CACTGCGGCC 1608 

CCCCACGTGA ACTCGGTTGT AACGGGGCTG GGAAGAAAAG CAGAGAGAGA ATTGCAGAGA 1668 

ATCAGACTCC TTTTCCAGGG CCTCAGCTCC CTCCAGTGGT GGCCGCCCTG TACTCCCTGA 1728 

CGATTCCACT GTAACTACCA ATCTTCTACT TGGTTAAGAC AGTTTTGTAT CATTTTGCTA 1788 

AAAATTATTG GCTTAAATCT GTGTAAAGAA AATCTGTCTT TTTATTGTTT CTTGTCTGTT 1848 

TTTGCGGTCT TACAAAAAAA ATGTTGACTA AGGAATTCTG AGACAGGCTG GCTTGGAGTT 1908 

AGTGTATGAG GTGGAGTCGG GCAGGGAGAA GGTGCAGGTG GATCTCAAGG GTGTGTGCTG 1968 

TGTTTGTTTT GCAGTGTTTT ATTGTCCGCT TTGGAGAGGA GATTTCTCAT CAAAAGTCCG 2028 

TGGTGTGTGT GTGTGCCCGT GTGTGGTGGG ACCTCTTCAA CCTGATTTXG GCGTCTCACC 2088 

CTCCCTCCTC CCGTAATTGA CATGCCTGCT GTCAGGAACT CTTGAGGCCC TCGGAGAGCA 2148 

GTTAGGGACC GCAGGCTGCC GCGGGGCAGG GGTGCAGTGG GTGTTACCAG GCAAAGCACT 2208 

GCGCGCTTCT TCCCCAGGAG GTGGGCAGGC AGCTGAGAGC TTGGAAGCAG AGGCTTTGAG 2268 

ACCCTAGCAG GACAATTGGG AGTCCCAGGA TTCAAGGTGG AAGATGCGTT TCTGGTCCCT 2328 

TGGGAGAGGA CTGTGAACCG AGAGGTGGTT ACTGTAGTGT TTGTTGCCTT GCTGCCTTTG 2388 

CACTCAGTCC ATTTTCTCAG CACTCAATGC TCCTGTGCGG ATTGGCACTC CGTCTGTATG 2448 

AATGCCTGTG GTTAAAACCA GGAGCGGGGC TGTCCTTGCC ACGTGCCAAG ACTAGCTCAG 2508 

AAAAGCCGGC AGGCCAGAAG GACCCACCCT GAGGTGCCAA GGAGCAGGTG ACTCTCCCAA 2568 

CCGGACCCAG AACCTTCACG GCCAGAAAGT AGAGTCTGCG CTGTGACCTT CTGTTGGGCG 2628 

CGTGTCTGTT GGTCAGAAGT GAAGCAGCGT GCGTGGGGCC GAGTCCCACC AGAAGGCAGG 2688 

TGGCCTCCGT GAGCTGGTGC TGCCCCAGGC TCCATGCTGC TGTGCCCTGA GGTTCCCAGG 2748 

ATGCCTTCTC GCCTCTCACT CCGCAGCACT TGGGCGGTAG CCAGTGGCCA TGTGCTCCCA 2808 

ACCCCAATGC GCAGGGCAGT CTGTGTTCGT GGGCACTTCG GCTGGACCCC ATCACGATGG 2868 

ACGATGTTCC CTTTGGACTC TAGGGCTTCG AAGGTGTGCA CCTTGGTTCT CCCTTCTCCT 2928 

CCCCAGAGTT CCCCCGGATG CCATAACTGG CTGGCGTCCC AGAACACAGT TGTCAACCCC 2988 

CCCACCAGCT GGCTGGCCGT CTGTCTGAGC CCATGGATGC TTTCTCAATC CTAGGCTGGT 3048 

TACTGTGTAA GCGTGTTGGA GTACGGCGCC TTGAGCGGGT GGGAGCTGTG TGTTGAAGTA 3108 

CAGAGGGAGG TTGGGGTGGG TCAGAGCCGA GTTAAGAGAT TTTCTTTGTT GCTGGACCCC 3168 

TTCTTGAAGG TAGACGTCCC CCACCCGGAG AGACGTCGCG CTGTGGCCTG AAGTGGCGCA 3228 

AGCTTGCTTT GTAAATATCT GTGGTCCCGA TGTAGTGCCC AGAACGTTTG TGCGAGGCAG 3288 

CTCTGCGCCC GGGTTCCAGC CCGAGCCTCG CCGGGTCGCG TCTTCGGAGT GCTTGTGACA 3348 
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GTCCTTGCCC AGTATCTAGT CCCCGTCGCC CCGTGCAGGA GACGTAGGTA GGACGTCGTG 3408 

TCAGCTGTGC ACTGACGGCC AGTCTCCGAG CTGTGCGTTT GTATCGCCAC TGTATTTGTG 3468 

TACTTTAACA ATCGTGTAAA TAATAAATTC GGAATTC 3505 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Glu Leu Arg Val Gly Asn Arg Tyr Arg Leu Gly Arg Lys lie Gly 
15 10 15 

Ser Gly Ser Phe Gly Asp He Tyr Leu Gly Thr Asp He Ala Ala Gly 
20 25 30 

Glu Glu Val Ala He Lys Leu Glu Cys Val Lys Thr Lya His Pro Gin 
35 40 45 

Leu His He Glu Ser Lys He Tyr Lys Met Met Gin Gly Gly Val Gly 
50 55 60 

He Pro Thr He Arg Trp Cys Gly Ala Glu Gly Asp Tyr Asn Val Met 
65 70 75 80 

Val Met Glu Leu Leu Gly Pro Ser Leu Glu Asp Leu Phe Asn Phe Cys 
85 90 95 

Ser Arg Lys Phe Ser Leu Lys Thr Val Leu Leu Leu Ala Asp Gin Met 
100 105 110 

He Ser Arg He Glu Tyr He His Ser Lys Asn Phe He His Arg Asp 
115 120 125 

Val Lys Pro Asp Asn Phe Leu Met Gly Leu Gly Lys Lys Gly Asn Leu 
130 135 140 

Val Tyr He He Asp Phe Gly Leu Ala Lys Lys Tyr Arg Asp Ala Arg 
145 150 155 160 

Thr His Gin His He Pro Tyr Arg Glu Asn Lys Asn Leu Thr Gly Thr 
165 170 175 

Ala Arg Tyr Ala Ser He Asn Thr His Leu Gly He Glu Gin Ser Arg 
180 185 190 

Arg Asp Asp Leu Glu Ser Leu Gly Tyr Val Leu Met Tyr Phe Asn Leu 
195 200 205 

Gly Ser Leu Pro Trp Gin Gly Leu Lys Ala Ala Thr Lys Arg Gin Lys 
210 215 220 

Tyr Glu Arg He Ser Glu Lys Lys Met Ser Thr Pro He Glu Val Leu 
225 230 235 240 

Cys Lys Gly Tyr Pro Ser Glu Phe Ala Thr Tyr Leu Asn Phe Cys Arg 
245 250 255 
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Ser Leu Arg Phe Asp Asp Lys Pro Asp Tyr Ser Tyr Leu Arg Gin Leu 
260 265 270 

Phe Arg Asn Leu Phe His Arg Gin Gly Phe Ser Tyr Asp Tyr Val Phe 
275 280 285 

Asp Trp Asn Met Leu Lys Phe Gly Ala Ser Arg Ala Ala Asp Asp Ala 
290 295 300 

Glu Arg Glu Arg Arg Asp Arg Glu Glu Arg Leu Arg His Ser Arg Asn 
305 310 315 320 

Pro Ala Thr Arg Gly Leu Pro Ser Thr Ala Ser Gly Arg Leu Arg Gly 
325 330 335 

Thr Gin Glu Val Ala Pro Pro Thr Pro Leu Thr Pro Thr Ser His Thr 
340 345 350 

Ala Asn Thr Ser Pro Arg Pro Val Ser Gly Met Glu Arg Glu Arg Lys 
355 360 365 

Val Ser Met Arg Leu His Arg Gly Ala Pro Val Asn lie Ser Ser Ser 
370 375 380 

Asp Leu Thr Gly Arg Gin Asp Thr Ser Arg Met Ser Thr Ser Gin lie 
385 390 395 400 

Pro Gly Arg Val Ala Ser Ser Gly Leu Gin Ser Val Val His Arg 
405 410 415 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

CTAGATCTAG CTAGACCATG GTAGTTTTTT CTCCTTGACG 40 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
CATGCCATGG CACGACCTAG T 21 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CTAGATCTAG CTAGACCATG GTAGTTTTTT CTCCTTGACG 40 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GAATCGGGCC GCCGAGATCT CATATGGAGC TGAGAGTC 38 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CCCGGATCTA GCAGATCTCA T 21 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Ala Ser Ser Ser Gly Ser Lys Ala Glu Phe lie Val Gly Gly Tyr 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Arg Ser Met Thr Val Ser Thr Ser Gin Asp Pro Ser Phe Ser Gly Tyr 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
TACATCTAGA ATTATGGCGA GTAGCAGCGG C 31 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: Single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
AATGGATCCT TAGAAACCTG TGGGGGT 27 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
AATGGATCCT TAGAAACCTT TCATGTTACT CTTGGT 36 
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(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

TACATCTAGA ATTATGGAGC TGAGAGTCGG G 31 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
{ B ) TYPE : nucleic acid 

(C) STBIANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
GGATCCTCAT CGGTGCACGA CAGACTG 27 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
TACATCTAGA ATTATGGCAC GACCTAGTGG TCGATCG 37 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
GGGGATCCTA CTTCAGTAGG GGCTG 25 
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(2) INFORMATION FOR SEQ ID NO:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLCXjy: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Arg Ser Gly His Asn Thr Arg Gly Thr Gly Ser Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Arg Leu Gly His Asn Thr Arg Gly Thr Gly Ser Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Ser Ser Arg Pro Lys Thr Asp Val Leu Val Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Lys Ser Asp Asn Thr Lys Ser Glu Met Lys His Ser 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Gly Thr Asp lie Ala Ala Gly Glu 
1 5 

(2) INFORl^TION FOR SEQ ID NO: 55: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
<B) TYPE: amino acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Glu Arg Arg Asp Arg Glu Glu Arg Leu Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Thr Gly Lys Gin Thr Asp Lys Thr Lys Ser Asn Met Lys Gly Tyr 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Asp Leu Leu Gly Pro Ser Leu Glu Asp Leu Phe Gly Tyr 
15 10 
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CLAIMS 

1. An isolated polynucleotide sequence encoding a polypeptide 
with an amino acid sequence having at least about 35% homology in the protein 
kinase domain with the polynucleotide encoding HKR25 protein kinase. 

5 2. The polynucleotide of claim 1 wherein the encoded polypeptide 

possesses casein kinase activity. 

3. The polynucleotide of claim 1 wherein the encoded polypeptide 
possesses protein-serine/ threonine kinase activity. 

4. The polynucleotide of claim 1 wherein the encoded polypeptide 
10 possesses protein-tyrosine kinase activity. 

5. The polynucleotide of claim 1 wherein the encoded polypeptide 
possess protein-serine/thieonine and protein-tyrosine kinase activity. 

6. The polynucleotide of claim 1, wherein the polypeptide is 
characterized as: 

15 a) promoting normal meiotic recombination; and 

b) promoting the repair a DNA strand break which occurs at 
the cleavage site: 

1 

CAACAG 

20 GTTGTC . 

t 

7. The polynucleotide of claim 1, wherein the polypeptide 
comprises an amino acid sequence selected from the group consisting of GPSLED 
(amino acids 86 to 91 in SEQ ID NO: 2), RDIKPDNFL (amino acids 127 to 135 

25 in SEQ ID NO: 2), HIPYIUE (amino acids 164 to 169 in SEQ ID NO: 2), and 
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SVN (amio acids 181 to 183 in SEQ ID NO: 2) and conservative variations 
thereof. 

8. The polynucleotide of claim 1, selected from the group 
consisting of RNA, mRNA, genomic DNA and cDNA. 

5 9. An antisense polynucleotide according to claim 1. 

10. The polynucleotide of claim 1, selected from the group 
consisting of the DNA sequences of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 23, 30, 31, 
and 34. 

11. An isolated and purified polypeptide encoded by a DNA 
10 sequence of claim 10. 

12. The polynucleotide of claim 1 wherein the polynucleotide is 
isolated from organisms selected from the group consisting of Saccharomyces, 
Schizosaccharomyces, human, bovine, porcine, murine, avian and Drosophila 
species. 

15 13. An autonomously replicating DNA vector comprising a DNA 

according to claim 8. 

14. A procaryotic or eukaryotic host cell stably transformed or 
transfected with a DNA according to claim 8. 

15. A method for the production of a polypeptide possessing 
20 protein kinase and/or recombination/repair promoting activity comprising growing 

a host cell according to claim 14 in a suitable nutrient medium and isolating the 
desired polypeptide from said host cell or from the medium of its growth. 

16. A polypeptide product of the method of claim 15. 
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17. An antibcxly substance specific for a polypeptide of claim 15. 

18. A monoclonal antibody according to claim 17. 

19. A method for identifying a composition which modulates the 
protein kinase and/or recombination/repair promoting activity of an HRR25-like 

5 protein comprising: 

(a) incubating a system of components comprising the composition and 
the protein in the presence of a substrate for said protein wherein 
incubation is carried out under conditions sufficient to allow the 
components to interact; and 

10 (b) measuring the change in activity of said protein on said substrate. 

20. The method of claim 19 wherein the activity is promotion of 
repair of a DNA double strand break. 

21. The method of claim 19 wherein the activity is protein kinase 

activity. 

15 22. A method of treating a cell proliferative disorder associated 

with an HRR25-like protein comprising administering, to a subject with the 
disorder, a therapeutically effective amount of a composition which modulates the 
activity of the protein. 
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Hrr 2 5 MDLRVGRKFRIGRKIGSGSFGDI YHGTNLISG EEVA 

YckX /Cki2 SSRDDSTIIGMYKIGKKIGEGSFGVLFEGTNMING VPVA 

Yck2 /Cki 1 -SGSQSRDDSTIIGLHYKIGKKIGEGSFGVIiFEGTNMING I*PVA 

Nuf 1 MSQRSSQHIVGIHYAVGPKIGEGSFGVIFEGENILHSCQAOTGSKRDSSIIMANEPVA 

Hhp 1 MAU5LRIGNK YRIGRKIGSGSFGD I YLGTNWSG EEVA 

Hhp2 MTWDIKIGNKYRIGRKIGSGSFGQIYLGLNTVNG EQVA 

CKIalHU MASSSGSKAEFIVGGKYKLVRKIGSGSFGDIYIAINITNG EEVA 

CKIa2HU MASSSGSKAEFIVGGKYKXiVRKlGSGSFGDIYLAINITNG EEVA 

CKXa3Hu MASSSGSKAEFIVGGKYKI.VRKIGSGSFGDiyLAINITNG EEVA 

common G-KYKIGRKIGSGSFGDIY-GTN — NG E-VA 



Hrr2 5 iki*esirsrhpqi.dyesrvyrymggvgiffirwfgregeynamvidixgpsledlfhych 

Yckl /Cki2 ikfeprkteapquideyktykilngtpnifyayyfgqeglhnii.vidllgpsi*edlfdwcg 

Yck2 /cki 1 IKFEPRKTEAPQLKDE YRTYKIIAGTPGIFQEYYFGQEGLHNILVIDULGPSIiEDLFDWCG 

Nuf 1 IKFEPRHSDAPOLRDErRAYRII*NGCVGlPHAYYrGQEGMHNILIIDLI.GPSIiEDLFEWCN 

Hhp 1 IKLESTRAKHPQLE YE YRVYRILSGGVGIPFVRWFGVECDYNAMVMDLLGPSLEDLFNFCG 

Hhp2 VKI.EPLKARHHQLEYEFRVYNIUCGNIGIPTIRWFGVTNSYNAMVMDI.LGPSLEDLFCYCG 

CKIalHu VKLESQKARHPQI-I-YESKLYKILQGGVGIPHIRWYGQEKDYNVLVMDLLGPSI.EDLFNFCS 

CKIa2Hu VKr.ESQKARHPQI*LYESKI.YKILgGGVGIPHIRWYGQEKDyNVI.VMDLLGPSI*EDLFNFCS 

CKIa3Hu VKI.ESQKARHPQLLYESKLyKILQGGVGIPHIRWyGQEKDYNVLVMDLI.GPSLEDLFNFCS 

Common IKLEP-KA-HPQL-YE-RVYKII^G-VGIP — RWFG — G-YNALVIDIXGPSLEDLF — CG 

Hrr2 5 RRFSFKTVIMLALOMFCRIOYIHGRSFIHRDIKPDNFLMG — VGRRGST 

Yckl /Cki2 RKFSVKTWQVAVQMITLIEDLHAHDLI YRDIKPDNFLIGRPGQPDANN 

Yck2 /Cki 1 RRFSVKTVI.LlJU>QLITLIEDLHAHDr.IYRDIKPDNFLIGRPGQPDANK 

Nuf 1 RKFSVKTTCMVAKQMIDRVRAIHDHD^-IYRDIKPDNFLISQYQRISPEGKVIKSCASSSNN 

Hhpl RKFSX-KTVIXLADOIilSRIEFIHSKSFIiHRDIKPDNFLMG XGKRGNQ 

Hhp2 RKFTUCTVtLrADQLISRIEYVHSKSFLHRDIKPDNFLM KKHSNV 

CKIa IHu RRFTMKTVLMLADOMISRIEYVHTKNFIHRDIKPDNFLMG — IGRHCNK 

CKIa2Hu RRFTMKTVLKLADQMISRIEYVHTKNFIHRDIKPDNFLHG — IGRHCNK 

CKIa3Hu RRFTMKTVUCADOMISRIEYVHTKNFIHRDIKPDNFLMG — IGRHCNKCI*ESPVGKRKRS 

Common RRFS-KTVI.MI-ADOMISRIEYIH — DFIHRDIKPDNFLMG G N 

Hrr2 5 VHVIDFGLSKKYRDFNTHRHIPYRENKSLTGTARYASVNTHLGIE 

Yckl /Cki2 IHLIDFGMAKQYRDPKTKQHIPYREKKSLSGTARYMSINTHIGRE 

Yck2 /Ckil VHLIDFGMAKQYRDPKTKQHIPYREKKSLSGTARYMSINTHIGRE 

NU f 1 NDPNL lYMVDFGMAKQ YRDPRTKQHI P YRERKSLSGTARYMS INTHFGRE 

Hhpl VNIIDFGrAKKYRDHKTHLHIPYRENKNI.TGTARYASINTHLiGIE 

Hhp2 VTMIDFGlAKKYRDrKTHVHIPYRDNKMI.TGTARYASINTHIGIE 

CKIa IHU LFLIDFG1AKKYRDNRTRQH1PYREDKNI*TGTARYASINAHLGIE 

CKIa2HU I1FLIDFGIAKKYRDNRTRQHIPYREDKMI.TGTARYASINAHLGIE 

CKIaSHU MTVSTSQDPSFSGLNQLFLIDFGIAKKYRDNRTRQHIPYREDKNLTGTARYASINAHLGIE 

Common VHLIDFGLAKKYRDPKTHQHIPYRENKSLTGTARYASINTHLGIE 



FIGURE 2A 
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Hrr25 QSRIU>DLESLGYVLIYFCKGSLPWQGLKATTKKQKYDRIMEKKXiNVSVETI.CSGI.PL EF 

yckl/Cki2 QSRRDDMEALGHVrFYFLRGHLPWQGLQAPNNKQKyEKIGEKKRI.TNI*YDLAQGLPV — QF 

yck2 /Cki 1 QSRRDDMEAMGHVFrYFLRGQLPWQGI*QAPNNKQKYEKIGEKKRLTNIiYDIAQGI.PI QF 

Nuf 1 QSRIU>DLESIiGHVFrYFLRGSI.PWQGI<KAPNNKI*KyEKIGMTKQKLNPDDrJ-I.imAIPYQF 

Hhp 1 OSRRDDLESLGYVLVYFCRGSLPWQGIAATTKKQKYEKIMEKKISTPTEVIiCRGFPQ EF 
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